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COMPOSITIONS, METHODS, AND KITS USEFUL FOR THE 
5 DIAGNOSIS AND TREATMENT OF SPINAL MUSCULAR ATROPHY 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application claims priority pursuant to 35 U.S.C. § 1 19(e) to U.S. 
Provisional Application No. 60/100,866, filed on September 17, 1998. 

10 

STATEMENT OF FEDERALLY SPONSORED RESEARCH AND 

DEVELOPMENT 
This invention was supported in part by funds from the U.S. 
Government (National Institutes of Health Grant No. R01-GM37125) and the U.S. 
1 5 Government may therefore have certain rights in the invention. 

BACKGROUND OF THE INVENTION 
Spinal muscular atrophy (SMA) is characterized by degeneration of the 
anterior horn cells of the spinal cord, leading to progressive symmetrical limb and 

20 trunk paralysis and muscular atrophy. SMA is the second most common fatal 

autosomal recessive disorder, second only to cystic fibrosis, and the most common 
genetic cause of childhood mortality affecting 1 in 6,000 newborns (Roberts et al., 
1970, Arch. Dis. Child. 45:33-38; Pearn, 1973, J. Med. Genet. 10:260-265; Pearn, 
1978, J. Med. Genet. 15:409-413; Czeizel and Hamular, 1989, J. Med. Genet. 21:761- 

25 763). Childhood spinal muscular atrophies are divided into severe (type I, Werdnig- 

Hoffrnan disease) and mild forms (type II and III) according to the age of onset and the 
severity of the disease (Munsat, 1991, Neuromusc. Disord. 1:81; Crawford and Pardo, 
1996, Neurobiol. Dis. 3:97-1 10). The Survival of Motor Neurons (SMN) gene 
(Lefebvre et al., 1995, Cell 89:155-165) has been shown to be the SMA disease gene, 

30 and it is deleted or mutated in over 98% of SMA patients (Bussaglia et al., 1 995, Nat. 

Genet. 11:335-337; Chang etal., 1995, Am. J. Hum. Genet. 57:1503-1505; Cobben et 
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al., 1995, Am. J. Hum. Genet. 57:805-808; Hahnen et al., 1995, Hum. Mol. Genet. 
4:1927-1933; Hahnen et al., 1996, Am. J. Hum. Genet. 59:1057-1065; Lefebvre et al., 
1995, Cell 89:155-165; Rodrigues et al., 1995, Hum. Mol. Genet. 4:631-634; Velasco 
et al., 1996, Hum. Mol. Genet. 5:257-263; Lefebvre et al., 1997, Nat. Genet. 16:265- 
5 269). 

Two inverted gene copies of the SMN gene are located in a 500 kb 
inverted repeat at chromosome 5ql3. In over 98% of all SMA patients, the telomeric 
copy of SMN (SMN T ) is deleted or mutated while the centromeric copy of the gene 
(SMN°) is unaffected (Lefebvre et al., 1995, Cell 89:155-165). 
10 The SMN gene encodes a protein of about 296 amino acids having a molecular mass of 
approximately 40 kDa. The sequence of the protein does not exhibit any significant 
homology to any other protein of known function in the currently available protein 
databases. 

Recently, in the course of studies of the functions of heterogeneous 

15 nuclear ribonucleoproteins (hnRNPs) (Dreyfuss et al., 1993, Ann. Rev. Biochem. 

62:289-321), it was found that the SMN protein interacts with fibrillarin, an RNA- 
binding protein involved in rRNA processing, and with several other RNA-binding 
proteins (Liu and Dreyfuss, 1996, EMBO J. 15:3555-3565). Monoclonal antibodies to 
SMN localized the protein to a unique cellular location. SMN exhibits a general 

20 localization in the cytoplasm and is particularly concentrated in several prominent 

nuclear bodies called gems (for gemini of coiled bodies). Gems are novel nuclear 
structures which are related in number and size to coiled bodies and are usually found 
in close proximity to them (Liu and Dreyfuss, 1996, EMBO J. 15:3555-3565). Coiled 
bodies, which were first described by Ramon y Cajal (1903, Trab. Lab. Invest. Biol. 

25 2:1 29-22 1), are prominent nuclear bodies found in widely divergent organisms, 

including plant and animal cells (Bohmann et al., 1995, J. Cell Sci. 19: 107-1 13; Gall et 
al., 1995, Dev. Genet. 16:25-35). Coiled bodies contain the spliceosomal Ul, U2, 
U4/U6, and U5 snRNPs, U3 snoRNAs, and several proteins, including the specific 
marker p80-coilin, fibrillarin, and NOP140 (Bohmann et al., 1995, J. Cell Sci. 19:107- 

30 1 13, and references therein; Gall et al., 1995, Dev. Genet. 16:25-35). Expression of 

p80-coilin mutants and microscopic observations suggests a close association between 
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coiled bodies and the nucleolus (Raska et al., 1990, J. Struct Biol. 104:120-127; 
Andrade et al., 1991, J. Exp. Med. 173:1407-1419; Bohmann et al., 1995, J. Cell Biol. 
1 3 1:817-831). However, the specific functions of coiled bodies are not clear. Current 
ideas propose that coiled bodies may be involved in processing, sorting, and assembly 
5 of snRNAs and snoRNAs in the nucleus. The close association of gems and coiled 
bodies raises the possibility that the SMN protein and gems are also involved in the 
processing and metabolism of small nuclear RNAs (Liu and Dreyfuss, 1996, EMBO J. 
15:3555-3565). 

The Sm class of small nuclear ribonucleoproteins (snRNPs) Ul, U2, 

10 U4/6, and U5 are major constituents of the spliceosome, the catalytic center of the pre- 
mRNA splicing reaction (Moore et al., 1993, In: The RNA World, pp. 303-358, 
Gesteland and Atkins, eds., Cold Spring Harbor Laboratory Press, Plainview, NY; 
Madhani and Guthrie, 1994, Annu. Rev. Genet. 28:1-26). Each spliceosomal snRNP 
consists of one (Ul, U2, and U5) or two (U4/6) snRNAs, a common set of at least eight 

1 5 Sm proteins, termed B, B f , Dl , D2, D3, E, F, and G, and specific polypeptides that are 

associated with only one individual U snRNP (reviewed by Liihrmann et al., 1990, 
Biochim. Biophys. Acta Gene Struct. Express. 1087:265-292). With the exception of 
U6, all spliceosomal snRNAs share two structural features: the S'-terminal 
trimethylguanosine (m 3 G) cap and a short, single-stranded, eight-to-ten nucleotide 

20 uridine-rich sequence flanked by two hairpin loops, referred to as the Sm site (Branlant 
et al., 1982, EMBO J. 1:1259-1265; Reddy and Busch, 1988, In: Structure and 
Function of Major and Minor Small Nuclear Ribonucleoprotein Particles, pp. 1-37, 
Birnstiel, ed., Springer- Verlag, Berlin). The Sm site is the primary binding site for the 
Sm proteins. The remaining snRNA domains provide binding sites for the snRNA- 

25 specific snRNP proteins and for RNA-RNA interactions (Liihrmann et al., 1990, 

Biochim. Biophys. Acta Gene Struct. Express. 1087:265-292). U6 differs from the 
other spliceosomal U snRNAs in that it contains a y-monomethyl cap instead of the 
(m 3 G) cap and does not bind directly to Sm proteins due to its lack of an Sm site 
(Reddy and Busch, 1988, supra; Singh and Reddy, 1989, Proc. Natl. Acad. Sci. USA 

30 86:8280-8283). The snRNP-specific proteins have snRNP-specific functions in the 
splicing reaction. In contrast, the only known function for the Sm proteins is in the 
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biogenesis of U snRNPs. 

The biogenesis of snRNPs, which is illustrated in Figure 26 herein, is a 
complex, multistep process (DeRobertis, 1983, Cell 32:1021-1025; Fisher et aL, 1985, 
Cell 42:751-758 ; Mattaj, 1988, In: Structure and Function of Major and Minor Small 
5 Nuclear Ribonucleoprotein Particles, pp. 100-1 14, Birnstiel, ed., Springer- Verlag, 
Berlin; Feeney et aL, 1989, J. Biol. Chem. 264:5776-5783; Neuman de Vegvar and 
Dahlberg, 1990, Mol. Cell. Biol. 10:3365-3375; Zieve and Sauterer, 1990, Crit. Rev. 
Biochem. Mol. Biol. 25:1-46). Spliceosomal snRNAs that contain the Sm site are first 
exported to the cytoplasm, where they associate with the Sm proteins (B, B', Dl, D2, 

10 D3, E, F, and G) (Mattaj and DeRobertis, 1985, Cell 40: 1 1 1-1 1 8). Next, in a reaction 
that requires the assembled Sm core domain (comprising the Sm proteins bound to the 
Sm site), the 7-methylguanosine (m 7 G) cap of the snRNAs is hypermethylated to yield 
2,2,7-trimethylguanosine (m 3 G) (Mattaj, 1986, Cell 46:905-911). In addition, varying 
numbers of nucleotides are trimmed from the 3' end of several of the snRNAs. Proper 

1 5 Sm core assembly, cap hypermethylation, and 3'-end processing are important for 
nuclear import of the assembled snRNP particles (Fischer and Luhrmann, 1990, 
Science 249:786-790; Hamm et aL, 1990, Cell 62:569-577). Finally, just before or 
after the nuclear import, many internal nucleotides are modified and more than 30 
snRNP-specific proteins associate with the individual snRNP precursors to complete 

20 their biogenesis (Mattaj, 1988, In: Structure and Function of Major and Minor Small 
Nuclear Ribonucleoprotein Particles, pp. 100-1 14, Birnstiel, ed., Springer- Verlag, 
Berlin; Luhrmann et aL, 1990, Biochim. Biophys. Acta Gene Struct. Express. 
1087:265-292; Neuman de Vegvar and Dahlberg, 1990, Mol. Cell. Biol. 10:3365-3375; 
Zieve and Sauterer, 1990, Crit. Rev. Biochem. Mol. Biol. 25:1-46). However, the 

25 detailed mechanism of how the Sm core proteins and the snRNP-specific proteins form 

functional assembled snRNPs is not clear. 

There is, to date, no effective treatment for SMA and the mechanism underlying the 
disease process is poorly understood. Thus, there is an acute and long-felt need to 
understand the mechanism of the disease process and, more importantly, for the 
30 development of methods of treating this common and usually fatal disease. The 

present invention addresses these needs. 
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SUMMARY OF THE INVENTION 
The invention relates to an isolated nucleic acid encoding a eukaryotic 
SIP1, and any mutants, derivatives, variants, and fragments thereof. 
5 In one aspect, the isolated nucleic acid shares at least about 20% 

homology with at least one of huSIPl (SEQ ID NO:l) and XeSIPl (SEQ ID NO:3). 

In another aspect, the isolated nucleic acid is selected from the group 
consisting of (SEQ ID NO:l), and (SEQ ID NO:3). 

The invention further relates to an isolated nucleic acid encoding a 
1 0 eukaryotic SIP 1 , wherein the SIP 1 shares at least about 20% homology with at least 
one of huSIPl (SEQ ID NO:2), and XeSIPl (SEQ ID NO:4), and any mutants, 
derivatives, variants, and fragments thereof. 

Also included in the invention is an isolated polypeptide comprising a 
eukaryotic SIP1, and any mutants, derivatives, variants, and fragments thereof. 
1 5 In one aspect, the SIP1 shares at least about 20% homology with at least 

one of SEQ ID NO:2 and SEQ ID NO: 4. 

In another aspect, the amino acid sequence of the SIP1 is at least one of 
SEQ ID NO:2 and SEQ ID NO:4. 

In another aspect, the nucleic acid further comprises a nucleic acid 
20 encoding a tag polypeptide covalently linked thereto. 

In one aspect, the tag polypeptide is selected from the group consisting 
of a myc tag polypeptide, a myc-pyruvate kinase tag polypeptide, a His6 tag 
polypeptide, an influenza virus hemagglutinin tag polypeptide, a maltose binding 
protein tag polypeptide, and a glutathione-S-transferase tag polypeptide. 
25 In another aspect, the nucleic acid of the invention further comprises a 

nucleic acid encoding a promoter/regulatory sequence operably linked thereto. 

In yet another aspect, the nucleic acid further comprises a nucleic acid a 
tag polypeptide. 

The invention also includes a cell comprising the nucleic acid of the 

30 invention. 

In one embodiment, the cell is a DT40 cell. 
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Also included is a vector comprising the isolated nucleic acid of the 

invention. 

In one aspect, the vector further comprises a nucleic acid encoding a 
promoter/regulatory sequence operably linked thereto. 
5 Also included is a recombinant cell comprising the isolated nucleic acid 

of the invention. 

In one aspect, the recombinant cell comprises the aforementioned 

vector. 

In addition, the invention relates to an antisense isolated nucleic acid 
1 0 complementary to the nucleic acid of the invention. 

Further, the invention relates to a cell comprising the antisense nucleic 
acid of the invention. 

In addition, there is included an antibody that specifically binds to a 
eukaryotic SIP1 polypeptide, or a fragment thereof. 
1 5 The antibody may be selected from the group consisting of a polyclonal 

antibody, a monoclonal antibody, and a synthetic antibody. 

In a preferred embodiment, the antibody is a monoclonal antibody 
selected from the group consisting of 2S7 and 2E17. 

The invention also relates to an isolated nucleic acid encoding a 
20 mammalian Gemini, and any mutants, derivatives, variants, and fragments thereof. 

In one aspect, the nucleic acid shares at least about 20% homology with 
human Gemin3 (SEQ ID NO:7). 

In another aspect, the isolated nucleic acid is SEQ ID NO:7. 
The invention further relates to an isolated nucleic acid encoding a 
25 mammalian Gemin3, wherein the Gemin3 shares at least about 20% homology with 

human Gemin3 (SEQ ID NO: 8), and any mutants, derivatives, variants, and fragments 
thereof. 

In addition, there is included an isolated polypeptide comprising a 
mammalian Gemin3, and any mutants, derivatives, variants, and fragments thereof. 
30 In one aspect, the Gemin3 shares at least about 20% homology with 

SEQ ID NO:8. 
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In another aspect, the Gemin3 is SEQ ID NO:8. 

In another aspect, the nucleic acid further comprises a nucleic acid 
encoding a tag polypeptide covalently linked thereto. 

In one embodiment, the tag polypeptide is selected from the group 
5 consisting of a myc tag polypeptide, a myc-pyruvate kinase tag polypeptide, a His6 tag 
polypeptide, an influenza virus hemagglutinin tag polypeptide, a maltose binding 
protein tag polypeptide, and a glutathione-S-transferase tag polypeptide. 

In another embodiment, the nucleic acid further comprises a nucleic 
acid encoding a promoter/regulatory sequence operably linked thereto. 
10 Also included is a vector comprising the just-mentioned nucleic acid. 

The vector may further comprise a nucleic acid encoding a 
promoter/regulatory sequence operably linked thereto. 

In addition, the invneiton includes a recombinant cell comprising the 
just-mentioned nucleic acid. 
15 The invention also includes a recombinant cell comprising the just- 

mentioned vector. 

The invention further includes an antisense isolated nucleic acid 
complementary to the just-mentioned nucleic acid, and a cell comprising the same. 

In addition, the invention relates to an nntibody that specifically binds to 
20 a mammalian Gemin3 polypeptide, or a fragment thereof. 

In one aspect, the antibody is selected from the group consisting of a 
polyclonal antibody, a monoclonal antibody, and a synthetic antibody. 

In one embodiment, the antibody is a monoclonal antibody selected 
from the group consisting of 1 1G9 and 12H12. 
25 Also included is an antibody that specifically binds to a eukaryotic 

Survival of Motor Neurons (SMN) polypeptide, or a fragment thereof. 

In one aspect, the SMN is human SMN and further wherein the antibody 
is monoclonal antibody 2B1. 

In another embodiment, the SMN is chicken SMN. 
30 In addition, the invention includes an isolated nucleic acid encoding a 

eukaryotic SIP1, and any mutants, derivatives, variants, and fragments thereof., 
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wherein the nucleic acid comprises a mutation that affects binding of SIP1 with SMN. 

Further, the invention includes an isolated nucleic acid encoding human 
SMN, wherein the nucleic acid comprises a mutation which mutation affects binding of 
SMN with at least one of another SMN protein, a Gemin3 protein, and an SIP1 protein. 
5 In addition, the invention includes nn isolated nucleic acid encoding the 

human SMN protein, wherein the nucleic acid comprises a mutation which mutation 
affects pre-mRNA splicing. 

The invention further includes a cell comprising the just-mentioned 

nucleic acid. 

10 The invention also includes an isolated nucleic acid encoding a 

eukaryotic SIP1, and any mutants, derivatives, variants, and fragments thereof, wherein 
the nucleic acid comprises a mutation which mutation affects binding of SIP1 with 
SMN. 

The invention also includes a fusion protein comprising a tag 
1 5 polypeptide and at least a portion of an SMN protein. 

In one aspect, the tag polypeptide is selected from the group consisting 
of a myc tag polypeptide, a myc-pyruvate kinase tag polypeptide, a His 6 tag 
polypeptide, an influenza virus hemagglutinin tag polypeptide, a maltose binding tag 
polypeptide, and a glutathione-S-transferase tag polypeptide. 
20 The invention further includes a fusion protein comprising a tag 

polypeptide and at least a portion of an SIP1 protein. 

In addition, the invention includes a fusion protein comprising a tag 
polypeptide and at least a portion of a Gemin3 protein. 

The invention also includes a vector comprising a nucleic acid encoding 
25 human SMN wherein the nucleic acid comprises a mutation and further wherein the 

mutation affects SMN binding to at least one of another SMN protein, a Gemin3 
protein, and an SIP1 protein. 

The invenrion also includes a composition comprising an isolated 
purified SMN protein and a protein that binds specifically with SMN. 
30 In one aspect, the protein that binds specifically with SMN is selected 

from at least one of another SMN protein, an SIP1 protein, a Gemin3 protein, and an 
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Sm protein. 

In one embodiment, the composition further comprising a ribonucleic 

acid. 

The invention further relates to a method of stimulating snRNP 
5 assembly. The method comprises contacting an extract comprising snRNP components 
with SMN, thereby stimulating snRNP assembly. 

Also included is a mammalian cell comprising an exogenous SMN 
modulating sequence selected from the group consisting of a nucleic acid encoding 
SMN, an antisense nucleic acid complementary to a nucleic acid encoding SMN, and a 
10 ribozyme specific for ribonucleic acid encoding SMN, wherein the cell exhibits higher 
or lower levels of SMN protein compared with an otherwise identical cell which does 
not comprise the exogenous SMN modulating sequence. 

In one embodiment, the exogenous SMN modulating sequence is an 
antisense nucleic acid and further wherein the cell exhibits a lower level of SMN 
1 5 protein compared with an otherwise identical cell which does not comprise the 
antisense nucleic acid. 

In another embodiment, the cell further exhibits an altered growth 
characteristic compared with an otherwise identical cell which does not comprise the 
antisense nucleic acid. 

20 In yet another embodiment, the exogenous SMN modulating sequence 

is a nucleic acid encoding SMN wherein the nucleic acid encoding SMN is covalently 
linked to a nucleic acid encoding a HA tag polypeptide, and further wherein expression 
of the exogenous SMN modulating sequence inhibits expression of endogenous SMN. 

Also included is a method of identifying a compound which affects the 

25 level of SMN expression in a cell. The method comprises contacting the cell with a 

test compound and comparing the level of SMN expression in the cell with the level of 
SMN expression in a otherwise identical cell which is not contacted with the test 
compound, wherein a higher or lower level of SMN expression in the cell contacted 
with the compound compared with the level of SMN expression in the otherwise 

30 identical cell which is not contacted with the compound is an indication that the 
compound affects the level of SMN protein in the cell. 
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In one aspect, the compound increases the level of SMN expression in a 

cell. 

In another aspect, the cell is obtained from a SMA type I patient. 
In a further aspect, the cell is selected from the group consisting of a 
5 fibroblast and a lymphoblastoid cell. 

In addition, the invenition includes a method of identifying a test 
compound which is a candidate SMA therapeutic. The method comprises contacting a 
cell with a test compound and comparing the level of SMN expression in the cell with 
the level of SMN expression in an otherwise identical cell which is not contacted with 
10 the test compound, wherein a lower level of SMN expression in the cell contacted with 
the test compound compared with the level of SMN expression in the otherwise 
identical cell which is not contacted with the test compound is an indication that the 
test compound is a candidate SMA therapeutic, thereby identifying a compound which 
is a candidate SMA therapeutic. 
1 5 In one aspect, the cell is obtained from a SMA type I patient. 

In another aspect, the cell is selected from the group consisting of a 
fibroblast and a lymphoblastoid cell. 

The invention further includes a method of identifying a compound 
which affects the level of SMN expression in a cell comprising an SMN modulating 
20 sequence. The method comprises contacting the cell with a test compound and 

comparing the level of SMN expression in the cell with the level of SMN expression in 
an otherwise identical cell which is not contacted with the test compound, wherein a 
higher or lower level of SMN expression in the cell contacted with the compound 
compared with the level of SMN expression in the cell which is not contacted with the 
25 compound is an indication that the compound affects the level of SMN expression in 

the cell. 

In one aspect, the SMN modulating sequence is selected from the group 
consisting of an isolated nucleic acid encoding SMN, an antisense nucleic acid 
complementary to a nucleic acid encoding SMN, and a ribozyme specific for 
30 ribonucleic acid encoding SMN. 

In another aspect, the SMN modulating sequence is an antisense nucleic 
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acid complementary to a nucleic acid encoding SMN. 

Also included in the invention is a method of identifying a compound 
useful for the treatment of SMA. The method comprises contacting a cell comprising 
an antisense nucleic acid complementary to a nucleic acid encoding SMN with a test 
5 compound and comparing the level of SMN expression in the cell with the level of 

SMN expression in an otherwise dentical cell which is not contacted with the test 
compound, wherein a higher level of SMN expression in the cell contacted with the 
compound compared with the level of SMN expression in the cell which is not 
contacted with the compound is an indication that the compound is useful to treat 

10 SMA, thereby identifying a compound useful for the treatment of SMA. 

In addition, the invention relates to a method of assessing whether a test 
compound affects binding of SMN with a protein that specifically binds with SMN. 
The method comprises (a) making a first preparation comprising a surface having at 
least a portion of SMN bound thereon, the test compound, and a labeled protein that 

15 specifically binds with SMN; (b) assessing the amount of the labeled protein bound 

with the surface in the first preparation; and (c) comparing the amount of the labeled 
protein bound with the surface in the first preparation and the amount of labeled 
protein bound with the surface in an otherwise identical preparation to which the test 
compound is not added, whereby a difference between the amount of labeled protein 

20 bound with the surface in the first preparation and in the otherwise identical 

preparation is an indication that the test compound affects the binding of SMN with a 
protein that specifically binds with SMN. 

In one aspect, the protein that specifically binds with SMN is selected 
from the group consisting of another SMN protein, a SIP1 protein, a Gemin3 protein, a 

25 SmB protein, a SmB' protein, a SmDl protein, a SmD2 protein, and a SmD3 protein. 

The invention further relates to a method of assessing whether a test 
compound is useful for treatment of SMA. The method comprises (a) making a first 
preparation comprising a surface having at least a portion of SMN bound thereon, the 
test compound, and a labeled protein that specifically binds with SMN; (b) assessing 

30 the amount of the labeled protein bound with the surface in the first preparation; and 

(c) comparing the amount of the labeled protein bound with the surface in the first 
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preparation and the amount of labeled protein bound with the surface in an otherwise 
identical preparation to which the test compound is not added, whereby a lower amount 
of the labeled protein bound with the surface in the first preparation and in the 
otherwise identical preparation is an indication that the test compound is useful for 
5 treatment of SMA. 

In one aspect, the protein that specifically binds with SMN is selected 
from the group consisting of another SMN protein, a SIP1 protein, a Gemin3 protein, a 
SmB protein, a SmB' protein, a SmDl protein, a SmD2 protein, and a SmD3 protein. 

Also included is a method of enhancing splicing of mRNA. The method 

10 comprises incubating an in vitro pre-mRNA processing extract in the presence of 
SMN, or any mutant, derivative, variant, and fragment thereof, thereby enhancing 
splicing of the mRNA. 

In addition, the invention includes a method of identifying a compound 
that affects pre-mRNA splicing. The method comprises incubating an extract capable 

15 of pre-mRNA splicing in the presence or absence of a test compound and comparing 

the level of pre-mRNA splicing in the extract in the presence of the test compound with 
the level of splicing of pre-mRNA in the absence of the test compound, wherein a 
higher or a lower level of pre-mRNA splicing in the extract in the presence of the test 
compound, compared with the level of pre-mRNA splicing in the extract in the absence 

20 of the test compound, is an indication that the test compound affects pre-mRNA 

splicing. 

In addition, the invention relates to a method of identifying a test 
compound that is useful to treat SMA. The method comprises incubating an extract 
capable of pre-mRNA splicing in the presence or absence of a test compound and 

25 comparing the level of pre-mRNA splicing in the extract in the presence of the test 

compound with the level of splicing of pre-mRNA in the absence of the test compound, 
wherein a higher level of pre-mRNA splicing in the extract in the presence of the test 
compound, compared with the level of pre-mRNA splicing in the extract in the absence 
of the test compound, is an indication that the test compound is useful to treat SMA. 

30 The invention further relates to a method of identifying a compound that 

affects snRNP assembly. The method comprises incubating an extract capable of 
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snRNP assembly in the presence or absence of a test compound and comparing the 
level of snRNP assembly in the extract in the presence of the test compound with the 
level of snRNP assembly in the absence of the test compound, wherein a higher or a 
lower level of snRNP assembly in the extract in the presence of the test compound, 
5 compared with the level of snRNP assembly in the extract in the absence of the test 
compound, is an indication that the test compound affects snRNP assembly. 

In addition, there is provided a method of identifying a test compound 
that is useful to treat SMA. The method comprises incubating an extract capable of 
snRNP assembly in the presence or absence of a test compound and comparing the 

1 0 level of snRNP assembly in the extract in the presence of the test compound with the 
level of snRNP assembly in the absence of the test compound, wherein a higher level 
of snRNP assembly in the extract in the presence of the test compound, compared with 
the level of snRNP assembly in the extract in the absence of the test compound, is an 
indication that the test compound is useful to treat SMA. 

15 The invention further includes a method of assessing the presence or 

degree of SMA in a mammal. The method comprises obtaining a biopsy comprising 
motor neurons from the mammal and assessing the number and morphology of gems in 
the motor neurons, wherein a lower number of gems in the motor neurons, compared 
with the number of gems in motor neurons obtained from an otherwise identical 

20 mammal which does not have SMA, is an indication that the mammal has SMA, and 
further wherein the absence of or the presence of a minimal number of gems in the 
mammal having SMA is directly related to the severity of the SMA in the mammal. 

The is further provided in the invention a method of assessing the 
presence or degree of SMA in a mammal. The method comprises comparing the level 

25 of binding of SMN obtained from the mammal to a protein that specifically binds with 

SMN with the level of binding of SMN wild type to an identical protein that 
specifically binds with SMN, wherein a lower level of binding of the SMN from the 
mammal to the protein that specifically binds with SMN compared with the level of 
binding of SMN wild type with the identical protein that specifically binds with SMN 

30 is an indication of the presence or degree of SMA in a mammal. 

In one aspect, the protein that specifically binds with SMN is selected 
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from the group consisting of an SMN protein, an SIP1 protein, and a Gemin3 protein. 

The invention additionally includes a knock-out targeting vector, the 
vector comprising a first nucleic acid portion encoding a sequence 5' of the open 
reading frame encoding SMN and a second nucleic acid portion encoding a nucleic 
acid sequence 3' of the open reading frame encoding SMN. 

In one aspect, the SMN is chicken SMN (SEQ ID NO:9). 

In another aspect, the vector further comprises a nucleic acid encoding a 
selectable marker covalently linked thereto. 

In one aspect, the first and second nucleic acid portions flank the nucleic 
acid encoding the selectable marker. 

Also included is a recombinant cell comprising the aforementioned 
knock-out targeting vector. 

The cell amy further comprise a vector comprising an isolated nucleic 
acid encoding SMN. 

In one embodiment, the cell is a chicken pre-B lymphoid DT40 cell 
In addition, the invnetion includes a method of identifying a compound 
that affects SMN expression in a cell. The method comprises contacting the just- 
mentioned cell with a test compound and comparing the level of SMN expression in 
the cell with the level of SMN expression in an otherwise identical cell which is not 
contacted with the test compound, wherein a higher or lower level of SMN expression 
in the cell contacted with the test compound compared with the level of SMN 
expression in the otherwise identical cell which is not contacted with the compound is 
an indication that the compound affects SMN expression in a cell, thereby identifying a 
compound that affects SMN expression in a cell. 

There is also provided a method of identifying a compound that is 
useful to treat SMA. The method comprises contacting the aforementioned cell with a 
test compound and comparing the level of SMN expression in the cell with the level of 
SMN expression in an otherwise identical cell which is not contacted with the test 
compound, wherein a higher level of SMN expression in the cell contacted with the test 
compound compared with the level of SMN expression in the otherwise identical cell 
which is not contacted with the compound is an indication that the compound increases 
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SMN expression in a cell, thereby identifying a compound that is useful to treat SMA. 

In addition, the invention includes a method of identifying a compound 
useful for the treatment of SMA. The method comprises contacting the 
aforementioned cell with a test compound and comparing the level of growth of the 
cell with the level of growth of an otherwise identical cell which is not contacted with 
the test compound, wherein a higher level of growth of the cell contacted with the 
compound compared with the level of growth of the cell which is not contacted with 
the compound is an indication that the compound is useful to treat SMA. 

Also included in the invention is an isolated nucleic acid encoding a 

chicken SMN. 

In one aspect, the nucleic acid shares at least about 20% homology with 

SEQ IDNO:9. 

Further included is an isolated nucleic acid encoding chicken SMN, 
wherein the chicken SMN shares at least about 20% homology with SEQ ID NO:10. 

In addition, the invention includes an isolated polypeptide comprising 

chicken SMN. 

In one aspect, the SMN shares at least about 20% homology with SEQ 

ID NO: 10. 

In another aspect, the SMN is SEQ ID NO: 10. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The file of this patent contains at least one drawing executed in color. 
Copies of this patent with color drawing(s) will be provided by the Patent and 
Trademark Office upon request and payment of the necessary fee. 

Figure 1 is a diagram depicting the amino acid sequence alignment of 
human SIP1 (huSIPl) (also referred to as Gemin2) (SEQ ID NO:2) and of Xenopus 
laevis SIP1 (XeSIPl) (SEQ ID NO:4) sequences. Also shown is the amino acid 
sequence alignment of SIP1 with the S. cerevisiae Brrl protein (SEQ ID NO:6). The 
boxes indicate identical amino acids, and the borderless gray boxes indicate similar 
amino acids. 

Figure 2A is an image of an SDS-PAGE gel depicting the interaction of 
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SIP1 with SMN in vitro, SIP1 interacted with immobilized GST-SMN chimeric 
protein in vitro. The interaction was resistant to a 1 M salt (NaCl) wash. Under the 
same conditions, SIP1 did not bind with GST alone even at low salt (200 mM NaCl) 
concentration. 

5 Figure 2B is an image of a Western blot depicting the fact that SIP1 and 

SMN are associated with each other in vivo. Immunoprecipitation of total HeLa extract 
was accomplished using monoclonal antibodies 2B1 directed against SMN and 2S7 
directed against SIP1. The immunoprecipitated proteins were analyzed by Western 
blot using 2S7 and 2B1 (lane 2B1 IP and lane 2S7 IP, respectively). 

10 Figure 2C is an image of a Western blot depicting the formation of a 

complex by SIP 1 and SMN of approximately 300 kDa or more in the cytoplasm. HeLa 
cytoplasmic SI 00 extract was fractionated on the basis of protein size on a G3000-SW 
column. The fractions were analyzed by SDS-PAGE, and the SMN and SIP1 proteins 
were detected by Western blot. 

15 Figure 3 A is an image depicting the co-localization of SIP1 and SMN 

in gems. The image depicts a light microscopic image of an indirect 
immunofluorescence experiment on HeLa cells using monoclonal antibody 2S7 against 
the SIP1 protein. Note the general cytoplasmic staining and discrete nuclear structures. 

Figure 3B is an image depicting a contrast image of the same field as 

20 depicted in Figure 3 A. 

Figure 3C is an image depicting superimposed laser confocal images of 
double-label immunofluorescence microscopy experiments using antibodies against 
SIP1 (green) and antibodies against coiled body marker p80-coilin (red). Co- 
localization of red and green results in a yellow color. 

25 Figure 3D is an image depicting superimposed laser confocal images of 

double-label immunofluorescence microscopy experiments using antibodies against 
SIP1 (green) and antibodies against coiled body marker, fibrillarin (red). Co- 
localization of red and green results in a yellow color. 

Figure 3E is an image depicting superimposed laser confocal images of 

30 double-label immunofluorescence microscopy experiments using antibodies against 
SIP1 (green) and anti-Sm antibody Y12 (red). Co-localization of green and red results 
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in a yellow color. 

Figure 3F is an image depicting the superimposed confocal images of 
double-label immunofluorescence microscopy experiments using monoclonal antibody 
against SIP1 (green) and a rabbit polyclonal antiserum raised against exon 7 of the 
5 human SMN protein (red). Co-localization of green and red results in a yellow color. 

Figure 4A is an image of an SDS-PAGE gel depicting the 
coimmunoprecipitation of SMN- and SIP 1 -containing complexes with monoclonal 
antibodies against SMN and SIP1. The image demonstrates that the 
coimmunoprecipitation of SMN and SIP1 obtained from 35 [S]methionine-labeled HeLa 
10 total cell extract by monoclonal antibodies 2B1 (lane 2B1) and 2E17 (lane 2E17) 

resulted in a similar protein pattern. Control anti-Sm monoclonal antibody Y12 
immunoprecipitated Sm proteins and some snRNP-specific proteins (lane Y12), while 
negative control antibody SP2/0 illustrates the background of immunoprecipitation 
(lane SP2/0). 

15 Figure 4B is an image depicting a two-dimensional non-equilibrium pH 

gradient gel electrophoresis (NEPHGE) analysis of SMN-immunoprecipitated complex 
(2B1 panel) and the Sm-immunoprecipitated complexes containing core Sm proteins 
and some U snRNP-specific proteins from HeLa nuclear fractions immunoprecipitated 
with monoclonal antibody Y12 (Y12 panel). The dashed boxes indicate background 

20 proteins that were also detected in control SP2/0 immunoprecipitations. 

Figure 5A is an image of a gel depicting the fact that SMN protein 
bound directly with Sm B/B\ Dl-3, and E proteins. Purified snRNP proteins were 
analyzed on SDS-PAGE (lane designated "snRNPs") and were transferred to a 
nitrocellulose membrane. The membrane was then probed with 2 x 10 6 cpm of in vitro 

25 translated 35 [S]-labeled SMN protein. After washing away the nonspecific binding 

proteins, the membrane was exposed to an X-ray film (lane n35 [S]SMN"). The three D 
Sm proteins could not be resolved using this gel system. The data disclosed herein 
demonstrate that SMN has two distinct binding domains for the Sm proteins and for 
SIP1, respectively. 

30 Figure 5B is an image of a gel depicting binding of Sm proteins with 

SMN in vitro. 35 [S]-labeled, in v/fro-translated Sm proteins B, Dl, D2, D3, E, F, and G 
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were incubated for 30 minutes at 4°C with GST-SMN immobilized on glutathione- 
Sepharose beads. The beads were then washed six times with binding buffer as 
described elsewhere herein. Proteins that remained bound to the beads after washing 
were eluted by boiling the beads in SDS-PAGE sample buffer and the proteins were 
5 analyzed by SDS-PAGE followed by fluorography. 

Figure 5C is an image of a gel depicting the competition for SMN 
binding by two polypeptide fragments of SMN which were conjugated to BSA. In 
vitro translated 35 [S]-labeled Sm B and SIP1 proteins were mixed, and binding to GST- 
SMN immobilized on glutathione-Sepharose beads was performed as described in 

1 0 Figure 5B herein, either in the absence (-) or presence of BSA-coupled SMN peptide 
fragments corresponding to amino acids 13-44 (PI) or 240-267 (P2) of human SMN 
protein or to an unrelated control sequence from HIV-1 Rev NES (ctrl). After washing 
the beads, bound proteins were eluted by boiling the beads in SDS-PAGE sample 
buffer and the eluted proteins were analyzed by SDS-PAGE followed by fluorography. 

15 The data disclosed herein demonstrate that SMN has two distinct, non-overlapping 
binding domains for SIP1 (domain PI) and for the Sm proteins (domain P2), 
respectively, as depicted in the illustration at the bottom of this figure. 

Figure 6A is an image of a Western blot depicting the presence of SMN 
and SIP1 in the cytoplasm of Xenopus oocytes and the association of the proteins with 

20 spliceosomal snRNAs. Protein obtained from total Xenopus somatic cells (XL- 177 cell 

line) or from oocytes dissected into nucleus and cytoplasm (O. Nuc. and O. Cyto, 
respectively) were fractionated by SDS-PAGE and analyzed by Western blotting with 
anti-SMN antibody (2B1) or anti-SIPl (2E17). 

Figure 6B is an image of a gel depicting the immunoprecipitation of 

25 spliceosomal U snRNAs using anti-SIPl and anti-SMN antibodies. Selected mixtures 

of the indicated in vitro synthesized 32 [P]-labeled RNAs were injected into the 
cytoplasm of oocytes. Three hours later, immunoprecipitations were carried out using 
either anti-SMN antibody 2B1 or anti-SIPl antibody 2E17. Immunoprecipitated RNA 
(IP) was analyzed by gel electrophoresis. The supernatants (SUP) of the 2B1 

30 immunoprecipitations are depicted and the supernatants of the 2E1 7 

immunoprecipitations were similar to those of 2B1. 
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Figure 7 is an image of a gel depicting the fact that anti-SMN and anti- 
SIP1 antibodies immunoprecipitated Ul and U5 snRNAs obtained only from the 
cytoplasm of Xenopus oocytes. A mixture of 32 [P]-labeled Ul and U5 snRNAs was 
injected into the cytoplasm of oocytes. After incubation for 20 hours, the oocytes were 
5 dissected into nuclear (N) and cytoplasmic (C) fractions, as depicted in the illustration 

at the top of the figure. RNAs from both fractions were immunoprecipitated (IP) using 
either the anti-Sm antibody Y12, anti-SMN antibody 2B1, or anti-SIPl antibody 2E17. 
One-fifth of the total supernatant (SUP) was loaded on the gel. 

Figure 8 A is an image of a gel depicting the fact that anti-SIPl 

10 antibodies interfere with the biogenesis cycle of spliceosomal U snRNPs. Oocytes 

received a cytoplasmic injection of either water (-), anti-SMN antibody (2B1), anti-Sm 
antibody (Y12), or anti-SIPl antibody (2E17). The same oocytes were injected in the 
nucleus 1 hour later with a mixture of 32 [P]-labeled Ul, U5, and U6 snRNA. Fifteen 
hours later, the oocytes were dissected into nuclear (N) and cytoplasmic © fractions, 

1 5 and the injected RNAs were analyzed by electrophoresis on an RNA gel. 

Figure 8B is an image of a gel depicting inhibition of nuclear import of 
all spliceosomal snRNAs by anti-SIPl antibodies. In vitro transcribed, 32 [P]-labeled 
snRNAs Ul, U2, U4, and U5 were injected into the cytoplasm of oocytes either alone 
(-) or together with anti-SIPl antibody (2E17). The oocytes were incubated for an 

20 additional 15 hours and they were then dissected into nuclear (N) and cytoplasmic (C) 

fractions. Injected 32 [P]-labeled RNAs were isolated and analyzed by electrophoresis 
on a denaturing RNA gel. 

Figure 9 A is an image of a gel depicting the fact that anti-SIPl and 
anti-SMN antibodies affected the assembly of the Sm core domain of spliceosomal 

25 snRNPs. Anti-SIPl antibodies inhibited the assembly of the Sm core domain of all 

spliceosomal U snRNPs. Injections into the oocyte cytoplasm of either water (-), anti- 
Sm antibody (Y12), anti-SIPl antibody (2E17), anti-SMN antibody (2B1), or control 
antibody (SP2/0) were performed. One hour later, the same oocytes were injected a 
second time in the cytoplasm with a mixture of 32 [P]-labeled Ul, U5, and U6 snRNAs. 

30 In the experiment shown in the lane designated "SIPl post", Ul, U5, and U6 snRNAs 

were injected 1 hour prior to the injection of anti-SIPl antibody. After incubation for 
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an additional 1 hour, the oocytes were homogenized and the RNAs were 
immunoprecipitated with anti-Sm antibody Y12 (IP Y12). Immunoprecipitated RNAs 
were analyzed by electrophoresis on an RNA gel with the corresponding supernatants 
(SUP Y12). 

5 Figure 9B is an image of a gel depicting stimulation of the formation of 

the Sm core domain by the anti-SMN monoclonal antibody 2B1 . The data disclosed 
herein demonstrate stimulation of Sm protein binding to U5 snRNA in the presence of 
high concentrations of the anti-SMN antibody 2B 1. A mixture of 32 [P]-labeled U5 and 
U6 snRNAs was injected into the cytoplasm of oocytes either with water (-) or with 

10 anti-SMN and anti-SIPl antibodies (2B1 and 2E17, respectively) at either high (h, 

3ng/p,l) or low (1, ljig/jal) concentrations. After a 1 hour incubation, the oocytes were 
homogenized, and the RNAs were immunoprecipitated with the anti-Sm antibody Y12 
(IP Y12). Immunoprecipitated RNAs were analyzed by electrophoresis on an RNA gel 
alongside one-tenth of the supernatants (SUP Y12). 

1 5 Figure 10A is an image depicting the fact that SMN interacts directly 

with itself and with SmB, and that these interactions are affected by mutations found in 
SMA patients. A binding assay of His-tagged SMN and SmB recombinant proteins (2 
mg) with either GST or GST-SMN was performed as described elsewhere herein. 
Bound SMN and SmB were analyzed by SDS-PAGE and Western blotting with an 

20 anti-T7 tag antibody. Ten percent of the input is shown in the first lane. 

Figure 10B is an image depicting the fact that SMN interacts directly 
with itself and with SmB, and that these interactions are affected by mutations found in 
SMA patients. In vitro translated [ 35 S]methionine-labeled, myc-tagged SMN wild-type 
and mutant proteins were incubated with the indicated purified GST-fusions (SMN, 

25 SmB, or SIP1) as described elsewhere herein. Bound proteins were analyzed by SDS- 

PAGE and fluorography. Twenty percent of the input is depicted in the in vitro 
translation area. The area corresponding to GST-SmB binding is a 3-fold-longer 
exposure than the other areas. 

Figure 11 A is an image depicting the fact that SMN self-association 

30 specifically increases the binding affinity for Sm proteins. GST or GST-SMN was 

preincubated with or without a 4-fold molar excess of His-SMN as described elsewhere 
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herein. After washing away unbound His-SMN, in vitro translated [ 35 S]methionine- 
labeled SMN, SmB, or SIP1 was added and the binding assay was performed as 
described elsewhere herein. Bound proteins were analyzed by SDS- PAGE and 
fluorography. Ten percent of the input is shown in the in vitro translation area. 
5 Figure 1 IB is an image depicting the fact that SMN self-association 

specifically increases the binding affinity for Sm proteins. GST-SMN was 
preincubated with or without a 4-fold molar excess of His-SMN as described above in 
Figure 1 1 A. Then, in vitro translated [ 35 S]methionine-labeled Sm proteins were 
added, and the binding assay was performed as described elsewhere herein. Twenty- 

1 0 five percent of the input is shown in the in vitro translation area. 

Figure 12A is an image depicting titration analysis of the effect of SMN 
self-association on SmB binding. GST-SMN was preincubated with the indicated 
molar excess of His-tagged SMNwt, SMNY272C, or SMNAEx7. After washing 
unbound recombinant proteins, in vitro translated [ 35 S]methionine-labeled SmB was 

1 5 added and binding was performed as described elsewhere herein. Each binding assay 

was analyzed by Western blotting using anti-T7 tag mAb to detect bound His-tagged 
SMN proteins (SMN) and by autoradiography to detect bound SmB (lower band 
designated Sm B). 

Figure 12B is an image depicting titration analysis of the effect of SMN 

20 self-association on SmB binding. GST-SMNY272C was preincubated with the 

indicated molar excess of His-SMNY272C and processed further as described 
previously elsewhere herein. 

Figure 13A is an image depicting the fact that SMN mutants found in SMA 
patients are defective in oligomerization and interaction with Sm proteins. SMN wild 

25 type but not SMN mutants of SMA patients form oligomers. Purified recombinant 

His-tagged SMN, SMNY272C, SMNAEx7, and SmB proteins were analyzed 
individually by HPLC gel filtration as described elsewhere herein. The fractions 
obtained therefrom were analyzed by SDS-PAGE, and the proteins were detected by 
Western blotting. The indicated positions of the molecular mass markers were 

30 determined by independent column chromatographies. 

Figure 13B is an image depicting the fact that SMN oligomers bind to 



10873 v.2 



-21 - 



SmB. The indicated mixtures of purified recombinant His-tagged SMN or SMNAEx7 
with His-tagged SmB were analyzed by HPLC gel filtration. The fractions were 
analyzed by SDS-PAGE, and the proteins were detected by Western blotting. The 
indicated positions of the molecular mass markers were determined by independent 
5 column chromatographies. 

Figure 14 is an image depicting the reduced association of SMNY272C 
and SMNAEx7 with Sm proteins in vivo. 293T cells were transiently transfected with 
the indicated myc-tagged SMN constructs and cell protein extracts were analyzed by 
coimmunoprecipitation. Briefly, total cell extracts were immunoprecipitated with 

10 either anti-Sm (Y12) or anti-SIPl (2E17) monoclonal antibodies (mAbs). Total cell 

extracts (10% of the input) and the anti-Sm immunoprecipitates were analyzed by 
Western blotting with anti-myc (9E10) and Y12 antibodies; the anti-SIPl 
immunoprecipitates were analyzed by Western blotting with 9E10 and 2E17. A 
relatively low amount of antibody was used in the immunoprecipitations to improve 

15 the detection of SmB/B' and SIP1 over the light chains of the immunoglobulins, which 

migrate very closely. The transfected proteins were overexpressed approximately 5- to 
10-fold compared with endogenous SMN. Only SmB is depicted because other Sm 
proteins were not detected by Western blotting using Y12 antibody. The transfected 
SMNAEx7 migrated closer to full-length SMN on a 12.5% polyacrylamide SDS- 

20 PAGE. 

Figure 15A is a diagram depicting the structure of the SMN protein and 
its interacting domain. The amino acid numbers and the borders of exons are 
indicated. SIP 1 -interacting domain resides at the amino terminus of SMN as 
determined by competition experiments (see, e.g., Example 1). SMN self-association 

25 and SMN/Sm interaction domains overlap with the conserved YG box at the carboxyl- 

terminus of SMN as determined by deletion, mutation, and competition experiments. 

Figure 15B is a diagram depicting the interactions of SMN/SIP1 with 
Sm proteins. Monomeric SMN, associated with SIP1, which binds to SMN but not to 
itself, contains a low-affinity binding site for Sm proteins. SMN self-associates, 

30 forming at least a SMN/SIP1 tetrameric complex. In this oligomeric conformation, a 
binding site is formed with a much higher affinity for the Sm proteins. SMN mutations 
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found in SMA patients result in a reduced ability of SMN to self-associate (Burghes, 
1997, Am. J. Hum. Genet 61:9-15) and also map within the Sm-binding site itself 
(Brzustowicz et al., 1990, Nature 344:540-541), thus affecting the SMN interaction 
with Sm proteins. 

Figure 16A is an image depicting the redistribution of Sm proteins in 
cells expressing the SMNAN27 deletion mutant. The image depicts immunostaining 
of HeLa cells transiently transfected with myc-SMN wt using anti-myc tag 
monoclonal antibody 9E10 (green). 

Figure 16B is an image depicting the immunostaining of HeLa cells 
transiently transfected with myc-SMN wt using the anti-Sm monoclonal antibody Y12 
(red). 

Figure 16C is an image depicting double-label confocal 
immunofluorescence combining the images depicted in Figures 16A (anti-myc [green]) 
and 16B (anti-Sm [Y12] red) of HeLa ceils transiently transfected with myc-SMN wt. 
Co-localization of the green and red signals results in a yellow signal. The nuclear 
gems are indicated by arrows and the dashed line demarcates the nucleus. 

Figure 16D is an image depicting the redistribution of Sm proteins in 
cells expressing the SMNAN27 deletion mutant. The image depicts immunostaining 
of HeLa cells transiently transfected with myc-SMNAN27 using anti-myc tag 
monoclonal antibody 9E10 (green). 

Figure 16E is an image depicting the immunostaining of HeLa cells 
transiently transfected with SMNAN27 using the anti-Sm monoclonal antibody Y12 
(red). 

Figure 16F is an image of double-label confocal immunofluorescence 
combining the images depicted in Figures 16C (anti-myc [green]) and 16D (anti-Sm 
[Y12] red) on HeLa cells transiently transfected with myc-SMN AN27. Co-localization 
of the green and red signals results in a yellow signal. The cytoplasmic accumulations 
are indicated by arrows and the dashed line demarcates the nucleus. 

Figure 17A is an image depicting the distribution of SMNAN27 in cells 
expressing the SMNAN27 deletion mutant protein. This image depicts the 
immunostaining of HeLa cells transiently transfected with myc-SMN AN27 using anti- 
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myc tag monoclonal antibody 9E10 (green). 

Figure 17B is an image depicting the redistribution of U snRNAs in 
cells expressing the SMNAN27 deletion mutant. This image depicts in situ 
hybridization demonstrating U2 snRNA localization using a U2-specific antisense 
5 oligo probe (red). 

Figure 17C is an image depicting the co-localization of SMNAN27 and 
U2 snRNAs in cells expressing the SMNAN27 deletion mutant protein. This image 
depicts double-label confocal immunofluorescence using anti-myc tag monoclonal 
antibody 9E10 (green) and U2-specific antisense oligo probe (red). Co-localization 
10 results in a yellow signal. The cytoplasmic accumulations are indicated by arrows and 
the dashed line demarcates the nucleus. 

Figure 17D is an image depicting the distribution of SMNAN27 in cells 
expressing the SMNAN27 deletion mutant protein. This image depicts the 
immunostaining of HeLa cells transiently transfected with myc-SMNAN27 using an 
15 anti-myc tag rabbit affinity-purified polyclonal antibody against the 9E10 epitope 

(green). 

Figure 17E is an image depicting the cap hypermethylation of snRNAs 
in the cytoplasm of cells transiently transfected with SMNAN27 deletion mutant. This 
image depicts the immunostaining of HeLa cells transiently transfected with myc- 

20 SMNAN27 using anti-TMG cap monoclonal antibody K121 (red). 

Figure 17F is an image depicting the co-localization of SMNAN27 and 
TMG capped snRNAs in cells expressing the SMNAN27 deletion mutant protein. This 
image depicts double-label confocal immunofluorescence using an anti-myc tag rabbit 
affinity purified polyclonal antibody against the 9E10 epitope (green) and anti-TMG 

25 cap monoclonal antibody Kl 2 1 (red). Co-localization of green and red signals results 
in a yellow signal. The cytoplasmic accumulations are indicated by arrows and the 
dashed line demarcates the nucleus. 

Figure 18A is an image depicting the distribution SMN wild type in 
cells transiently transfected fnyc-SMNwt. HeLa cells transiently transfected with 

30 mycSMNwt were immunostained using anti-myc tag monoclonal antibody 9E10 (green 
signal). 
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Figure 18B is an image depicting the localization of p80-coilin in cells 
expressing mycSMNwt tag recombinant protein. This image depicts a HeLa cell 
transiently transfected with mycSMN immunostained using anti-p80 coilin rabbit 
polyserum R288 (red signal). 
5 Figure 18C is an image depicting double-label confocal 

immunofluorescence demonstrating co-localization of SMNwt and coilin in cells 
transiently transfected with mycSMN recombinant. HeLa cells transiently transfected 
with mycSMNwt were immunostained using anti-myc tag (Figure 18 A) and anti-p80 
coilin (Figure 18B) and the two images were combined. Co-localization of red and 

10 green signals results in a yellow signal. The nuclear gems are indicated by arrows and 

the dashed line demarcates the nucleus. 

Figure 18D is an image depicting the distribution SMNAN27 
recombinant in cells transiently transfected with myc-SMNAN27. HeLa cells 
transiently transfected with myc SMNAN27 were immunostained using anti-myc tag 

1 5 monoclonal antibody 9E1 0 (green signal). 

Figure 18E is an image depicting the localization of p80-coilin in cells 
expressing myc SMNAN27 tag recombinant protein. This image depicts a HeLa cell 
transiently transfected with mycSMNAN27 immunostained using anti-p80 coilin rabbit 
polyserum R288 (red signal). 

20 Figure 18F is an image depicting double-label confocal 

immunofluorescence demonstrating co-localization of SMNAN27 and coilin in cells 
transiently transfected with mycSMN recombinant. HeLa cells transiently transfected 
with mycSMN AN27 were immunostained using anti-myc tag (Figure 18D) and anti- 
p80 coilin (Figure 18E) and the two images were combined. Co-localization of red and 

25 green signals results in a yellow signal. The cytoplasmic accumulations are indicated 

by arrows and the dashed line demarcates the nucleus. 

Figure 19A is an image of a gel depicting the effect of SMN wild type 
and SMNAN27 recombinant proteins on pre-mRNA splicing in vitro. This image 
depicts in vitro splicing assays in the presence of SMN wild type and SMNAN27 

30 recombinant proteins. [a- 32 P]-labeled chicken 5-crystallin pre-mRNA was incubated 
for 20 or 40 minutes at 30°C with 50 micrograms of HeLa cell nuclear extracts (13 
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milligrams per milliliter) in the presence of buffer D (lanes 2 and 3), or 37.5 
micrograms per milliliter of recombinant SMN wt (lanes 4 and 5) or 37.5 micrograms 
per milliliter of SMNAN27 (lanes 6 and 7). The RNA products were analyzed by gel 
electrophoresis. 

5 Figure 19B is an image of an RNA gel depicting in vitro splicing assays 

after pre-incubation with SMN wild type and SMNAN27 deletion recombinant 
proteins. HeLa cell nuclear extracts were first incubated for 20 minutes at 30°C in the 
presence of buffer D (lanes 2 and 3), or 37.5 micrograms per milliliter of SMN wt 
(lanes 4 and 5) or 37.5 micrograms per milliliter of SMNAN27 (lanes 6 and 7). [<x- 32 P] 

10 labeled chicken 5-crystallin pre-mRNAs were added and the reactions were incubated 
further at 30°C for the indicated additional times. RNA products were analyzed by gel 
electrophoresis. The structures of the splicing products are shown schematically on the 
right-hand side of the figure. 

Figure 20 is an image depicting a gel demonstrating the effect of SMN 

15 recombinant proteins on spliceosome formation in vitro. Splicing reactions were 

carried using [<x- 32 P]-labeled chicken 5-crystallin pre-mRNA without (lanes 1 and 2) or 
with (lanes 3-5) preincubation. Splicing complexes were fractionated using 
electrophoresis on a native polyacrylamide gel and the complexes were visualized 
using autoradiography as described elsewhere herein. 

20 Figure 21 is an image depicting a gel demonstrating the effect of pre- 

incubation with of anti-SMN antibodies on pre-mRNA splicing in vitro. Two 
micrograms of the following purified antibodies were added to the splicing reaction 
mixture: Y12 (anti-Sm), anti-SMN monoclonal (2B1) 5 anti-SMN exon 7 (<xEx7), and 
control antibody (SP/0). [a- 32 P]-labeled chicken 5-crystallin pre-mRNA was added to 

25 the splicing reaction mixture and the mixture was incubated for an additional 20 or 40 

minutes at 30°C. The RNA products were analyzed by gel electrophoresis. A 
schematic representation of the structure of each radiolabeled RNA product is depicted 
along the right-hand side of the figure. 

Figure 22 is an image depicting a gel demonstrating the effect of 

30 SMNwt, SMNAN27, SMNY272C, and SMNAEx7 recombinant proteins on in vitro 
splicing of adenovirus 2 major late pre-mRNA (Ad-2 AIVS). The splicing reaction 
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mixture was preincubated using either buffer D (lane 2) or 20 micrograms per milliliter 
of recombinant protein as indicated in lanes 3-6. [<x- 32 P] -labeled adenovirus 2 major 
late pre-mRNA was then added to the mixture and the samples were incubated for 40 
minutes at 30°C. The RNA products were analyzed by gel electrophoresis. A 
5 schematic representation of the structure of each radiolabeled RNA product is depicted 
along the right-hand side of the figure. The total inhibition of pre-mRNA cleavage by 
SMNAN27 was not as complete as that disclosed previously elsewhere herein (e.g., 
Figure 19A) because of the lower amount of recombinant protein used in this 
experiment. 

1 0 Figure 23 is a diagram depicting a model of the role of SMN in pre- 

mRNA splicing. Without wishing to be bound by any particular theory, this diagram 
depicts the nuclear function(s) of SMN, and of gems and coiled bodies, in recycling 
snRNPs after pre-mRNA splicing as discussed more fully elsewhere herein. 

Figure 24A is an image of a gel depicting immunoprecipitation of the 

1 5 SMN complex using anti-SMN monoclonal antibody 2B 1 from [35S]methionine 

labeled HeLa cell lysate. The immunoprecipitated proteins were analyzed by SDS- 
PAGE and autoradiography (24 hours exposure). Antibody 2B1 (lane 2B1) 
immunoprecipitated SMN, Gemin2, Sm proteins B ,B', Dl-3, F, F and G, and a group 
of proteins indicated as pl75, pl05, p97, p95, p60 and p50. The SP2/0 lane 

20 demonstrates the background of immunoprecipitation (lane SP2/0). 

Figure 24B is an image depicting a longer exposure (36 hours) of the 
bottom part of the gel depicted in Figure 24A. The longer exposure detects Sm 
proteins more clearly than the shorter 24 hour exposure. The position of the molecular 
weight markers is indicated on the left of SP2/0 lane in kilodaltons (kDa). 

25 Figure 25A is a diagram depicting sequencing of Gemin3 using 

nanoelectrospray tandem mass spectrometry. A selected portion of the spectrum of the 
unseparated in-gel tryptic digest of pi 05 band is depicted. Peptide ions designated by 
"T" are autolysis products of trypsin and were identified by comparison with the 
spectrum acquired from the control sample. Other peptide ions observed in the 

30 spectrum were, in turn, isolated by the first mass analyzer of a triple quadrupole 
instrument, fragmented in the collision cell and their tandem mass spectra was 
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acquired. Upon searching a comprehensive protein sequence database using tandem 
mass spectrometry data, peptide ions designated with filled triangles were identified as 
tryptic peptides originating from PTB-associated splicing factor (PSF) (P23246). The 
presence of PSF in the SMN complex turned out to be negative by both co- 
5 immunoprecipitation and direct binding to several components of the SMN complex. 
Peptide ions designated using asterisks (*) were identified as peptides from 
immunoglobulins used for immunoaffinity purification. Tandem mass spectra acquired 
from the peptide ions having m/z 552.9 and 622.5 did not identify any protein in the 
protein sequence database(s) searched. However, when the search was performed 

1 0 against a comprehensive database of expressed sequence tags (/. e. , dbEST) the peptide 
sequence VLISTDLTSR from EST clone W65908 was identified as matching the 
tandem mass spectrum. After full length sequence had been obtained as described 
elsewhere herein, the tandem mass spectrum acquired from the peptide ion at m/z 
622.5 was matched to the peptide LNSSDPSLIGLK (SEQ ID NO: 9) present in the 

1 5 sequence of Gemin3 . 

Figure 25B is a diagram depicting the tandem mass spectrum acquired 
from doubly charged peptide precursor ion having m/z552.9. Continuous series of the 
fragment ions containing the C-terminus of the peptide (Yn-ions) (Roepstorff and 
Fohlman, 1984, Biomed. Mass Spectrom. 1 1:601) was produced upon collisional 

20 fragmentation of tryptic peptides. A short stretch of the peptide sequence was deduced 

unambiguously by considering precise mass differences between adjacent Yn-ions 
(designated in bold capital letters) observed in a part of the spectrum above m/z of the 
parent ion. Leucine and isoleucine residues have the same nominal mass and are 
usually not distinguishable by mass spectrometry and are therefore indicated as "L/I." 

25 The determined portion of a peptide sequence was combined with the masses of 

correspondent Yn-ions and with the mass of intact peptide into a peptide sequence tag 
as described by Mann and Wilm (1994, Analytical Chemistry 66:4390-4399), which 
was subsequently used to searching protein and EST databases using the program 
PeptideSearch. Once the database search produced a hit, the correspondent peptide 

30 sequence was retrieved from a database and masses of the ions from the N-terminal 
fragment series (A- and B-ions) were used to verify the match. This protocol enabled 
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highly confident protein identification of a single peptide containing ten amino acid 
residues was matched to the sequence of a single EST clone. 

Figure 26 is a diagram depicting a schematic representation of the 
modular structure of Gemin3 and the various domains of Gemin3. This diagram 
5 illustrates that Gemin3 encodes a DEAD-box containing RNA helicase. Further, the 

diagram discloses that Gemin3 contains seven helicase motifs (helicase motifs are 
reviewed in De la Cruz et al, 1999, TIBS 24:192-198) designated I, la, II, III, IV, V 
and VI, which are represented by black boxes with conserved amino acids represented 
in white letters within the boxes. Amino acid residues designated by upper case letters 

10 indicate highly conserved residues, lower cases indicate less conserved residues. The 
SMN interacting domain (i.e., amino acids 456 to 547) is boxed in grey. 

Figure 27 is a diagram depicting the amino acid sequence alignment of 
human Gemin3 (SEQ ID NO:8) and the human DEAD box ATP-dependent RNA 
helicase eIF4A-II. The N-terminal half of Gemin3 contains a DEAD box RNA 

15 helicase domain while the C-terminal half does not exhibit homology to any protein in 

the database. Amino acids shaded in light grey indicate similar amino acids, and dark 
grey indicates identical aimno acids. The position of the seven helicase motifs as well 
as the SMN interacting domain are indicated. 

Figure 28A is an image depicting a gel demonstrating that monoclonal 

20 antibodies 1 1G9 and 12H12 are specific for Gemin3. Myc-tagged Gemin3, hnRNP Al 

and SMN proteins were produced using a rabbit reticulocyte lysate in the presence of 
[ 35 S]methionine. The labeled proteins were immunoprecipitated using monoclonal 
antibody 1 1G9 and 12H12 and the immunoprecipitated material was analyzed using 
SDS-PAGE followed by autoradiography. Ten percent (10%) of the total in vitro 

25 translated proteins is depicted on the left panel. 

Figure 28B is an image depicting immunoblotting using monoclonal 
antibody 1 1G9 on purified 6His-Gemin2 and 6His-Gemin3. The position of the 
molecular weight markers is indicated on the left side of the figure (in kDa). 

Figure 28C is an image of a Western blot depicting immunoblotting 

30 using monoclonal antibodies 12H12 and 1 1G9 on total HeLa cell protein extract. In 

addition to the strong pl05 signal, 12H12, but not 1 1G9, produced a weal signal by 
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binding to a protein of about 55 kDa on Western blots. The position of the molecular 
weight markers is indicated on the left side of the figure (in kDa). 

Figure 29 A is an image depicting the localization of Gemin3 in HeLa 
cells. This image depicts a laser confocal image of indirect immunofluorescence on 
5 HeLa cells using monoclonal antibody 12H12 against the Gemin3 protein. The data 

disclosed herein demonstrate general cytoplasmic staining, as well as nucleoplasms 
and discrete nuclear structures. 

Figure 29B is an image depicting a DIC image of the same cell depicted 
in Figure 29A. The arrows indicate gems. 

1 0 Figure 29C is an image depicting the fact that Gemin3 co-localizes with 

p80 coilin. This image depicts superimposed laser confocal images of double-label 
immunofluorescence microscopy experiments using antibodies against coiled bodies 
marker, p80 coilin (green) and anti-Gemin3 1 1G9 (red). Co-localization of green and 
red signals results in a yeiiow signal. Dashed lines demarcate the nucleus. 

1 5 Figure 29D is an image depicting the fact that SMN co-localizes with 

p80 coilin. This image depicts superimposed laser confocal images of double-label 
immunofluorescence microscopy experiments using antibodies against coiled bodies 
marker, p80 coilin (green), and anti-SMN antibody 2B1 (red). Co-localization of green 
and red signals results in a yellow signal! Dashed lines demarcate the nucleus. 

20 Figure 29E is an image depicting the fact that SMN co-localizes with 

Gemin3. This image depicts superimposed laser confocal images of double-label 
immunofluorescence microscopy experiments using monoclonal anti-Gemin3 1 1G9 
(red) and a rabbit affinity-purified antibody against Exon 7 of human SMN (green). 
Co-localization of green and red signals results in a yellow signal. Dashed lines 

25 demarcate the nucleus. 

Figure 30A is an image depicting a gel demonstrating that Gemin3 is in 
a complex with SMN, Gemin2 (also referred to as SIP1), and the spliceosomal Sm 
proteins. This image depicts immunoprecipitations of [ 35 S)methionine labeled HeLa 
cell using monoclonal antibodies specific to SMN (2B1), Gemin3 (1 1G9), and the 

30 snRNP core Sm proteins (Y12). The unmunoprecipitated proteins were analyzed using 
SDS-PAGE followed by autoradiography (24 hours exposure). The 



10873 v.2 



-30- 



immunoprecipitations were performed in the presence of Empigen BB or Triton X-100 
as indicated elsewhere herein. The identity of several pertinent proteins 
immunoprecipitated is indicated on the right side of the figure. 

Figure 30B is an image depicting a longer exposure (36 hours) of the 
5 bottom portion of the gel depicting immunoprecipitation in the presence of TritonX-100 
(i.e., Figure 30A). The longer exposure was performed to visualize the Sm proteins 
more clearly. 

Figure 30C is an image of a gel depicting the fact that Gemin3, SMN, 
Gemin2 and the Sm proteins can be co-immunoprecipitated in vivo. This image 

10 depicts that monoclonal antibodies against Gemin3 co-immunoprecipitate SMN and 

Gemin2. Immunoprecipitation using total HeLa protein extract was performed using 
monoclonal antibody 1 1G9 and the immunoprecipitated proteins were analyzed by 
Western blot using 2B1 (anti-SMN) or using 2S7 (anti-Gemin2) antibodies. 

Figure SOD is an image of a gel depicting the fact that monoclonal 

15 antibodies against SMN, Gemin2, and the Sm proteins co-immunoprecipitate Gemin3. 

Immunoprecipitation using total HeLa cell protein extract was performed using 
monoclonal antibodies against SMN (lane 2B1 IP), Gemin2 (lane 2S7 IP), or the Sm 
proteins (lane Y12 IP). The immunoprecipitated proteins (IP) were analyzed by 
Western blot using the anti-Gemin3 monoclonal antibody 12H12. The position of the 

20 molecular weight markers is indicated on the left side of the figure (in kDa). The 

positions of the light chain (I.e.) and heavy chain (h.c.) of the antibodies used for 
immunoprecipitation are indicated. 

Figure 30E is an image depicting the fact that Gemin3, SMN, and 
Gemin2 are found in a complex of about 800 kDa, or more, in the cytoplasm. HeLa 

25 cytoplasmic SI 00 extract was fractionated using a Superose 6 HR 10/30 column. The 

fractions were analyzed by SDS-PAGE, and the Gemin3, SMN, and Gemin2 proteins 
were detected by Western blotting. The fraction number and the molecular weight 
standards were as indicated. 

Figure 31A is an image of a gel depicting the fact that Gemin3 interacts 

30 directly with SMN in vitro. In vitro translated [ 35 5]methionine labeled mycSMN and 
myc-Gemin2 proteins were incubated with purified GST-Gemin3 as described 
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elsewhere herein. Bound proteins were analyzed using SDS-PAGE followed by 
fluorography. The in vitro translation panel depicts 2% of the total input. 

Figure 31B is an image of a gel depicting that Gemin3 interacts with a 
subset of the Sm proteins in vitro. In vitro translated [ 35 S]methionine labeled mycSm 
proteins B, DL D2, D3, E, F, and G were incubated with purified GST-Gemin3 or 
GST-SMN as described elsewhere herein. Bound proteins were analyzed using SDS- 
PAGE followed by fluorography. The in vitro translation panel contains 2% of the 
input. 

Figure 31C is an image of a gel depicting the fact that Gemin3 interacts 
directly with SMN and SmB in vitro. Wild-type recombinant 6His-SMN or 6His-SmB 
proteins were incubated with purified GST-Gemin3, or with GST alone. The input 
lane depicts 10% of 6His-SMN and 6His-SmB. Bound proteins were analyzed by 
SDS-PAGE and Western blotting. 

Figure 31D is an image of a gel depicting the fact that mutations found 
in SMA severely affect SMN interaction with Gemin3. In vitro translated 
[ 35 S]methionine labeled wild-type myc-SMN and the indicated mutant proteins were 
incubated with purified GST-Gemin3 as described elsewhere herein. The proteins 
bound with GST-Gemin3 were analyzed using SDS-PAGE followed by fluorography. 
The in vitro translation panel shows 2% of the input. 

Figure 31E is an image of a gel depicting the fact that SMN 
oligomerization does not affect the Gemin3 interaction. In vitro translated 

35 

[ SJmethionine labeled myc-Gemin3 or myc-SmB proteins were incubated with 
purified GST or GST-SMN which had been pre-incubated or not with 6His-SMN wild- 
type protein as described elsewhere herein. Bound myc-Gemin3 and myc-SmB 
proteins were analyzed using SDS-PAGE followed by fluorography (top portion of the 
figure ). The in vitro translation panel contains 5% of the input. The position of the 
molecular weight markers is indicated on the left (in kDa). 

Figure 31F is an image of a Western blot depicting the amount of 6His- 
SMN which bound to GST or GST-SMN. The input lane depicts 10% of 6His-SMN. 
About 5% of the 6His-SMN input is bound to GST-SMN. 

Figure 32A is a schematic representation of the myc-Gemin3 wild type 
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and deletion mutants used in the binding assays. The black boxes represent the seven 
helicase domains and the grey box represents the auxiliary domain C-terminal domain. 

Figure 32B is an image of a gel depicting that Gemin3 interaction of 
Gemin3 with SMN is mediated by the non-conserved C-terminal domain of Gemin3. 
5 In vitro translated [ 35 S]methionine labeled wild-type and mutant myc-Gemin3 proteins 
were incubated with purified GST-SMN or GST alone. Bound myc-Gemin3 proteins 
were analyzed using SDS-PAGE followed by fluorography. The in vitro translation 
panel shows 5% of the input. 

Figure 33A is an image depicting a Western blot demonstrating that the 

1 0 hemagglutinin-chicken SMN (HA-cSMN) fusion protein was expressed in transfected 
cells DT40 in the absence of tetracycline from the cell but was not expressed in the 
presence of 2 micrograms per milliliter of tetracycline. 

Figure 33B is an image depicting a Western blot demonstrating the fact 
that expression of HA-SMN upon removal of tetracycline from the culture medium 

1 5 caused expression of a HA-cSMN fusion protein, which is larger than wild type cSMN 

because of the additional amino acid residues comprising the HA tag polypeptide, and 
lack of expression of wild type chicke SMN (cSMN). More specifically, in the 
presence of 2 micrograms per milliliter tetracycline, only wild type cSMN is detected 
by Western blot analysis using anti-chicken SMN antibody. In the absence of 

20 tetracycline (/. e. , in lanes designated "0") only the higher molecular weight HA-cSMN 
fusion protein was detected and cSMN was not detected. 

Figure 34A is an image depicting a Western blot demonstrating that 
inhibition of cSMN expression and induction of HA-cSMN expression are dose- 
dependent upon the concentration of tetracycline present in the cell culture medium. 

25 That is, decreasing concentration of tetracycline caused increased expression of HA- 

cSMN and concommitant decreased expression of cSMN. 

Figure 34B is an image depicting a Western blot demonstrating the fact 
that expression of HA-cSMN increases upon removal of tetracycline and that 
repression of cSMN expression upon removal of tetracycline is permanent. That is, 

30 removal of tetrycline results in continued expression of HA-cSMN and permanent lack 
of expression of cSMN (until cell death occurs since HA-cSMN does not provide 
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necessary SMN function to the cell). 

Figure 35A is the amino acid sequence of chicken SMN (SEQ ID 

NO:10). 

Figure 35B is the nucleic acid sequence of chicken SMN (SEQ ID 

5 NO:9). 

Figure 36A-B is the polypeptide sequence and 36C-F is the nucleic 
acid sequence of human Gemin3 (SEQ ID NO: 7). 

Figure 37 is the nucleic acid sequence of human SIP1 (SEQ ID NO:l; 
Gen Bank Accession No. AF027150). 
10 Figure 38 is the nucleic acid sequence of frog SIP1 (SEQ ID NO:3; Gen 

Bank Accession No. AF0271501). 

DETAILED DESCRIPTION OF THE INVENTION 
The invention is based on the discovery of a novel gene (SEQ ID NO: 1) 

15 and a protein encoded thereby (SEQ ID NO:2), designated the Survival of Motor 

Neuron-Interacting Protein 1 (SIP1), so called because the protein is tightly associated 
with the SMA disease gene product, SMN. Further, the protein has been provisionally 
designated Gemin2 since it is the second component of gems to be identified to date, 
SMN being the first. In addition, the nucleic and amino acid sequences of frog and 

20 yeast homologs of human SIP1 are also disclosed herein. 

The invention further relates to the discovery that SMN and SIP1 form a 
large, approximately 300 kDa, more preferably, a 800 kDa, complex in vivo which 
includes other SMN-associated proteins such as the Sm proteins. The data provided 
herein establish that these proteins (e.g., SMN and SIP1) play an important role in 

25 mRNA biogenesis and that SMA is the first human disease identified involving a 

defect in mRNA metabolism. Indeed, the data disclosed herein demonstrate that the 
SMN proteins of SMA patients bind with themselves, with SIP1, or both, with less 
affinity that does wild type SMN. These results further indicate a causal connection 
between decreased association between SMN and SIP1 and SMA disease processes. 

30 Additionally, the invention relates to the discovery of a novel gene (SEQ ID NO:7) and 

protein encoded thereby (SEQ ID NO:8) which is the third protein component of gems 



10873 v.2 



-34- 



identified to date and has therefore been designated Gemin3. Further, the invention 
relates to the discovery that Gemin3 also binds with SMN such that Gemin3 is 
associated with the afore-mentioned complex involved in mRNA biogenesis. 

The invention also relates to a novel cell line which lacks endogenous 
5 SMN but which comprises an exogenous nucleic acid encoding chicken SMN (a novel 
homolog of human SMN [SEQ ID NO:9] which is disclosed elsewhere herein) under 
the control of an inducible promoter/regulatory sequence. This cell line is an important 
tool for the study of SMA disease mechanism(s) and for the identification of novel 
therapeutics useful for treatment of this deadly hereditary disease for which there is no 

10 known cure. 

Nucleic acids and proteins 

The present invention, therefore, provides a novel isolated nucleic acid 
(SEQ ID NO:l) encoding the human SIP1 protein (huSIPl [SEQ ID NO:2]). In 
addition, the invention provides an isolated nucleic acid (SEQ ID NO:3) encoding the 

15 Xenopus laevis SIP1 protein (XeSIPl [SEQ ID NO:4. 

The invention includes isolated nucleic acids encoding a eukaryotic 
SIP1 which is, preferably, at least about 20% homologous to at least one of huSIPl 
(SEQ ID NO:l), and XeSIPl (SEQ ID NO:3). More preferably, the isolated nucleic 
acid encoding a eukaryotic SIP1 is at least about 25%, preferably, at least about 35%, 

20 more preferably, at least about 45%, even more preferably, at least about 55%, more 
preferably, at least about 65%, even more preferably, at least about 75%, yet more 
preferably, at least about 85% homologous, more preferably, at least about 95% and 
even more preferably, at least about 99% homologous to at least one of huSIPl (SEQ 
ID NO: 1) and XeSIPl (SEQ ID NO:3). More preferably, the isolated nucleic acid 

25 encoding a eukaryotic SIP is human huSIPl and frog XeSlPl . Most preferably, the 
isolated nucleic acid encoding a eukaryotic SIP1 is SEQ ID NO:l or SEQ ID NO:3. 

Thus, the invention also includes an isolated nucleic acid encoding a 
eukaryotic SIP1 where the nucleic acid encodes a protein which protein is preferably, 

»* 

at least about 20% homologous to the amino acid sequence of at least one of human 
30 SIP1 (SEQ ID NO:2) and frog SIP1 (SEQ ID NO:4). More preferably, the isolated 
nucleic acid encodes a eukaryotic SIP1 which is at least about 25%, more preferably, 
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at least about 35%, even more preferably, at least about 45%, more preferably, at least 
about 55%, yet more preferably, at least about 65%, even more preferably, at least 
about 75%, yet more preferably, at least about 85% homologous, and more preferably, 
at least about 95%, and even more preferably, at least about 99% homologous to at 
5 least one of human SIP1 (SEQ ID NO:2) and frog SIP1 (SEQ ID NO:4). More 

preferably, the isolated nucleic acid encodes a eukaryotic SIP1 that is human SIP1 or 
frog SIP1. Most preferably, the isolated nucleic acid encodes a eukaryotic SIP1 having 
the amino acid sequence SEQ ID NO:2 or SEQ ID NO:4. 

Thus, the invention should be construed to include nucleic acids which 

10 encode human SIP1 (SEQ ID NO:2) and frog SIP1 (SEQ ID NO:4). 

The invention also includes a nucleic acid encoding a mammalian 
Gemin3 protein. Preferably, the nucleic acid encoding a mammalian Gemin3 is at least 
about 20% homologous to human Gemin3 (SEQ ID NO:7). More preferably, the 
isolated nucleic acid encoding a mammalian Gemin3 is at least about 25%, preferably, 

15 at least about 35%, more preferably, at least about 45%, even more preferably, at least 

about 55%, more preferably, at least about 65%, even more preferably, at least about 
75%, yet more preferably, at least about 85% homologous, more preferably, at least 
about 95% and even more preferably, at least about 99% homologous to (SEQ ID 
NO: 7). More preferably, the isolated nucleic acid encoding a mammalian Gemin3 is 

20 human Gemin3. Most preferably, the isolated nucleic acid encoding a mammalian 
Gemin3 is SEQ ID NO:7. 

In another aspect, the present invention includes an isolated nucleic acid 
encoding a mammalian Gemini, or a fragment thereof, wherein the protein encoded by 
the nucleic acid shares at least about 20% homology with the amino acid sequence of 

25 SEQ ID NO:8. Preferably, the nucleic acid encodes a protein that is about 35% 

homologous, more preferably 65% homologous, even more preferably 95% 
homologous, and most preferably about 99% homologous to the human Gemin3 
disclosed herein (SEQ ID NO:8). Even more preferably, the Gemin3 protein encoded 
by the nucleic acid is SEQ ID NO:8. 

30 The present invention includes a nucleic acid encoding chicken SMN, or 

a fragment thereof, wherein the nucleic acid shares at least about 20% homology with 
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chicken SMN (SEQ ID N0:9). Preferably, the nucleic acid is about 35% homologous, 
more preferably about 65% homologous, even more preferably 95% homologous, and 
most preferably about 99% homologous to the chicken SMN disclosed herein (SEQ ED 
NO:9). Even more preferably, the nucleic acid is SEQ ID NO:9. 
5 In another aspect, the present invention includes an isolated nucleic acid 

encoding chicken SMN, or a fragment thereof, wherein the protein encoded by the 
nucleic acid shares at least about 80% homology with the amino acid sequence of SEQ 
ID NO: 10. Preferably, the nucleic acid encodes a protein that is about 85% 
homologous, more preferably 85% homologous, even more preferably 95% 

10 homologous, and most preferably about 99% homologous to the chicken SMN 

disclosed herein (SEQ ID NO: 10). Even more preferably, the chicken SMN protein 
encoded by the nucleic acid is SEQ ID NO: 10. 

Thus, the invention should be construed to include nucleic acids which 
encode chicken SMN (SEQ ID NO: 10). 

15 The isolated nucleic acid of the invention should be construed to include 

an RNA or a DNA sequence encoding an SIP1 protein of the invention, and any 
modified forms thereof, including chemical modifications of the DNA or RNA which 
render the nucleotide sequence more stable when it is cell free or when it is associated 
with a cell. Similarly, the invention includes such modified forms of Gemin3 and 

20 chicken SMN. Chemical modifications of nucleotides may also be used to enhance the 
efficiency with which a nucleotide sequence is taken up by a cell or the efficiency with 
which it is expressed in a cell . Any and all combinations of modifications of the 
nucleotide sequences are contemplated in the present invention. 

The present invention should not be construed as being limited solely to 

25 the nucleic and amino acid sequences disclosed herein. Once armed with the present 

invention, it is readily apparent to one skilled in the art that other nucleic acids 
encoding SIP1, SMN, and Gemin3 proteins may be obtained by following the 
procedures described herein in the experimental details section for the isolation of the 
human, frog and yeast nucleic acids encoding SIP1 proteins, as well as those 

30 procedures used to identify and isolate human Gemin3 and chicken SMN, and the 

respective polypeptides encoded thereby (i.e., Gemin3 and chicken SMN), as disclosed 
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elsewhere herein. 

Further, the invention should be construed to include naturally occurring 
variants or recombinantly derived mutants of wild type SIP1 DNA sequences, which 
variants or mutants render the protein encoded thereby either more, less, or just as 
5 biologically active as the full-length wild type SIP1 of the invention. Any number of 
procedures may be used for the generation of mutant, derivative or variant forms of 
SIP1, Gemin3, and SMN, using recombinant DNA methodology well known in the art 
such as, for example, that described in Sambrook et al. (1989, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, New York), Ausubel et al. 

10 (1997, Current Protocols in Molecular Biology, Green & Wiley, New York), and 
elsewhere herein. 

As will be apparent from the experiments described herein, SIP1 
comprises a binding domain which mediates SIP1 binding to SMN. The invention 
includes a mutant of SIP 1, wherein a mutation is introduced into the sequence of SIP1 

1 5 cDNA such that the protein encoded thereby does not bind SMN or binds to a lesser 

extent. Such mutant is useful in the methods of the invention and for the study of the 
role of SMN-SIP1 interaction in mRNA biogenesis. Preferably, a mutant SIP1 gene 
which encodes an SIP1 protein comprising a deletion whereby amino acids from the 
carboxyl terminal of the SIP1 protein have been removed is useful in studying the 

20 association of SIP1 with SMN. Examples of such mutants (SIP1AC89 and 

SIP 1 AC 162) have been disclosed herein. However, the invention is not limited solely 
to these mutants; rather, the invention encompasses other mutants, comprising deletion 
and point mutations, which demonstrate altered binding to SMN or other SMN-SIP1 
complex associated proteins (e.g., Gemin3 and various Sm proteins) and ribonucleic 

25 acids. These mutants allow the functional mapping of various portions of the protein to 

determine which portion(s) of the protein mediate which function(s) of the full-length 
protein. 

Likewise, the present invention includes an isolated nucleic acid 
encoding SMN and mutants, homologs and variants thereof which exhibit altered 
30 binding with another SMN molecule and/or decreased binding with the SMN- 
associated protein SIP1 . These proteins are important tools to determine which 
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portion(s) of the proteins mediate the biological effect(s) associated with the full-length 
protein. Further, they are useful in elucidating the mechanisms of SMA and in the 
identification of therapeutics for treatment of this disease. 

The invention also includes selected mutant, homologs, and variants of 
5 Gemin3. Several of the proteins exhibited unchanged binding with SMN despite 

deletions of selected portions of the amino acid sequence (e.g., AN368C272Gemin3) 
while other proteins demonstrated lack of binding with SMN (e.g., AC328Gemin3 and 
AN548Gemin3). Such proteins are important tools in elucidating the effect of 
naturally-occurring and/or artificially produced amino acid sequences in cell processes 
10 and, more specifically, for assessing the function of various proteins, and fragments 

thereof, in SMA: 

The invention also includes a nucleic acid encoding SMN or SIP1, and 
mutants, derivatives, variants, and fragments thereof, that may retain biological 
activity. Such variants, i.e., analogs of SMN or SIP1, include proteins or polypeptides 

1 5 which have been or may be modified using recombinant DNA technology such that the 

protein or polypeptide possesses additional properties which enhance its suitability for 
use in the methods described herein, for example, but not limited to, variants 
conferring enhanced stability on the SMN-SIP1 complex, enhanced specific binding of 
SIP1 to SMN, and the like. 

20 Procedures for the introduction of amino acid changes in a protein or 

polypeptide by altering the DNA sequence encoding the polypeptide are well known in 
the art and are also described in Sambrook et al. (1989, supra); Ausubel et al. (1997, 
supra). 

The invention includes a nucleic acid encoding a mammalian SMN, 
25 SIP1 , or Gemin3 wherein a nucleic acid encoding a tag polypeptide is covalently 

linked thereto. That is, the invention encompasses a chimeric nucleic acid wherein a 
nucleic acid sequence encoding a tag polypeptide is covalently linked to a nucleic acid 
encoding at least one of human SIP1, frog SIP1, yeast Brrl, human Gemin3, and 
chicken SMN. Such chimeric (i.e., fusion) tag polypeptides are well known in the art 
30 and include, for instance, myc, myc-pyruvate kinase (myc-PK), His6, maltose biding 

protein (MBP), and glutathione-S-transferase (GST). However, the invention should in 
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no way be construed to be limited to the nucleic acids encoding the above-listed tag 
polypeptides. Rather, any nucleic acid sequence encoding a polypeptide which may 
function in a manner substantially similar to these tag polypeptides should be construed 
to be included in the present invention. 

5 A nucleic acid encoding a protein of interest (e.g., SMN, SIP1, Gemin3, 

and any mutant, derivative, variant, or fragment thereof) comprising a nucleic acid 
encoding a tag polypeptide and a fusion protein produced therefrom can be used to 
localize SMN, SIP1, and Gemin3 within a cell and to study expression, localization, 
and role(s) of the tagged protein in a cell before, during, and/or after exposing the cell 

10 to a test compound potentially useful for treatment of SMA. Further, addition of a tag 
to a protein of interest (e.g. , SMN, SIP1, Gemin3) polypeptide facilitates isolation and 
purification of the "tagged" protein such that the protein of interest can be easily 
produced and purified. 

Additionally, variants may be chemically linked to another protein. For 

15 example, SMN was conjugated with bovine serum albumin (BSA) as described in 
Fischer et al. (1995, Cell 82:475-483). However, the invention is not limited to 
chemically linking SMN with BSA, but also includes cross-linking SMN, SIP1, and 
Gemin3 proteins by chemical means to any suitable composition. Thus, it should be 
appreciated that the use of BSA is described herein by way of example only. 

20 In certain situations, it may be desirable to inhibit expression of SMN, SIP1, or 

Gemin3 in a cell that would otherwise express the protein. Therefore, the invention 
includes compositions useful for inhibition of expression of SMN, SIP1, and Gernin3. 
One such method of the invention features an isolated nucleic acid complementary to a 
portion or all of a nucleic acid encoding an SMN, SIP1, or Gemin3 which is in an 

25 antisense orientation with respect to transcription. Preferably, the antisense nucleic 

acid useful for inhibition of SIP1 expression shares at least 20% homology with at least 
one of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5. 

Similarly, an antisense nucleic acid useful for inhibition of Gemin3 
expression shares at least about homology 20% with SEQ ID NO:7. 

30 Likewise, an antisense nucleic acid useful for inhibition of Gemin3 

expression shares at least about homology 20% with SEQ ID NO:9. 
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The above-referred to antisense nucleic acids serve to inhibit the 
expression, function, or both, of human SIP1, frog SIP1, human Gemin3, and chicken 
SMN. 

The invention also includes an isolated polypeptide comprising a 
eukaryotic SIPL Preferably, the isolated polypeptide comprising a eukaryotic SIP 1 is 
at least about 20% homologous to at least one of human SIP1 (SEQ ID NO:2) and frog 
SIP1 (SEQ ID NO:4). More preferably, the isolated polypeptide comprising a 
eukaryotic SIP1 is at least about 35%, more preferably, at least about 65%, even more 
preferably, at least about 95%, and more preferably, at least about 99% homologous to 
at least one of human SIP1 and frog SIPL More preferably, the isolated polypeptide 
comprising a eukaryotic SIP1 is at least one human SIP1 and frog SIPL Most 
preferably, the isolated polypeptide comprising a mammalian eukaryotic SIP1 is at 
least one of SEQ ID NO: 2 and SEQ ID NO:4. 

The invention also includes an isolated polypeptide comprising a 
mammalian Gemin3. Preferably, the isolated polypeptide comprising a mammalian 
Gemin3 is at least about 20% homologous to human Gemin3 (SEQ ID NO:8). More 
preferably, the isolated polypeptide comprising a mammalian Gemin3 is at least about 
35%, more preferably, at least about 65%, even more preferably, at least about 95%, 
and more preferably, at least about 99% homologous to human Gemin3. More 
preferably, the isolated polypeptide comprising a mammalian Gemin3 is human 
Gemin3. Most preferably, the isolated polypeptide comprising a mammalian Gemin3 
isSEQIDNO:8. 

The invention also includes an isolated polypeptide comprising a 
chicken SMN. Preferably, the isolated polypeptide comprising a chicken SMN is at 
least about 20% homologous to SEQ ID NO:8. More preferably, the isolated 
polypeptide comprising a chicken SMN is at least about 35%, more preferably, at least 
about 65%, even more preferably, at least about 95%, and more preferably, at least 
about 99% homologous to chicken SMN. More preferably, the isolated polypeptide 
comprising is. chicken SMN. Most preferably, the isolated polypeptide comprising a 
chicken SMN is SEQ ID NO: 10. 

The present invention also provides for analogs of proteins or peptides 
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which comprise a eukaryotic SIP1, a mammalian Gemin3, and chicken SMN protein as 
disclosed herein. Analogs may differ from naturally occurring proteins or peptides by 
conservative amino acid sequence differences or by modifications which do not affect 
sequence, or by both. For example, conservative amino acid changes may be made, 
which although they alter the primary sequence of the protein or peptide, do not 
normally alter its function. Conservative amino acid substitutions typically include 
substitutions within the following groups: 

glycine, alanine; 

valine, isoleucine, leucine; 

aspartic acid, glutamic acid; 

asparagine, glutamine; 

serine, threonine; 

lysine, arginine; 

phenylalanine, tyrosine. 
Modifications (which do not normally alter primary sequence) include in vivo, or in 
vitro, chemical derivatization of polypeptides, e.g., acetylation, or carboxylation. Also 
included are modifications of glycosylation, e.g., those made by modifying the 
glycosylation patterns of a polypeptide during its synthesis and processing or in further 
processing steps; e.g., by exposing the polypeptide to enzymes which affect 
glycosylation, e.g., mammalian glycosylating or deglycosylating enzymes. Also 
embraced are sequences which have phosphorylated amino acid residues, e.g., 
phosphotyrosine, phosphoserine, or phosphothreonine. 

Also included are polypeptides which have been modified using 
ordinary molecular biological techniques so as to improve their resistance to 
proteolytic degradation or to optimize solubility properties or to render them more 
suitable as a therapeutic agent. Analogs of such polypeptides include those containing 
residues other than naturally occurring L-amino acids, e.g., D-amino acids or non- 
naturally occurring synthetic amino acids. The peptides of the invention are not 
limited to products of any of the specific exemplary processes listed herein. 

The present invention should also be construed to encompass "mutants," 
"derivatives," and "variants" of the peptides of the invention (or of the DNA encoding 
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the same) which mutants, derivatives and variants are SMN, SIP1, and Gemin3 
peptides which are altered in one or more amino acids (or, when referring to the 
nucleotide sequence encoding the same, are altered in one or more base pairs) such that 
the resulting peptide (or DNA) is not identical to the sequences recited herein, but has 
5 the same biological property as the peptides disclosed herein, in that the peptide has 

biological/biochemical properties of the SMN, SIP1, or Gemin3 peptide of the present 
invention. A biological property of a SMN protein should be construed but not be 
limited to include, the ability of the peptide to bind specifically and avidly with another 
SMN molecule and also to bind with SIP1, Gemin3, and various Sm proteins. 

10 Likewise, a biological property of the Gemin3 peptide of the invention 

is to bind specifically with SMN. 

A biological activity of the SIP1 of the invention is the ability to bind 
specifically with SMN. 

Similarly, a biological function of Gemin3 is the ability to specifically 

15 bind with SMN. In addition, a biological property of Gemin3 is the ability to function 

as a helicase as indicated by the presence of a DEAD motif conserved in RNA 
helicases as reviewed by reviewed in Staley and Guthrie (1998, Cell 92:315-326). 
Further, a biological activity of SMN, SIP1, and Gemin3 is to be able to participate in 
mRNA biosynthesis. 

20 Further, the invention should be construed to include naturally occurring 

variants or recombinantly derived mutants of SIP 1 and Gemin3 sequences, which 
variants or mutants render the protein encoded thereby either more, less, or just as 
biologically active as the full-length proteins of the invention. 

The nucleic acids, and peptides encoded thereby, are useful tools for 

25 elucidating the function(s) of SMN, SIP1 and Gemin3 in a cell. Further, they are 

useful for localizing the nucleic acid, protein, or both, in a cell and for assessing the 
level of expression of the nucleic acid and/or protein under selected conditions 
including in response to therapeutic treatment. Further, nucleic and amino acids 
comprising eukaryotic SIP1, and mammalian Gemin3 are useful diagnostics which can 

30 be used, for example, to identify a compound that affects expression of the protein and 
is a candidate SMA therapeutic. 
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In addition, the nucleic acids, the proteins encoded thereby, or both, can 
be administered to a mammal to increase or decrease expression of SIP1 or Gemin3 in 
the mammal. This can be therapeutic to the mammal if under or over-expression of 
SIP1 or Gemin3 in the mammal mediates a disease or condition associated with altered 
5 expression of the protein compared with normal expression of SIP1 and/or Gemin3 in a 

healthy mammal. Additionally, the nucleic and amino acids of the invention can be 
used to produce recombinant cells which are useful tools for the study of SMA, the 
identification of novel SMA therapeutics, and for elucidating the cellular role(s) of 
SIP1 and Gemin3 in mRNA biosynthesis and other cell processes, among other things. 

10 Further, the nucleic and amino acids of the invention can be used diagnostically, either 
by assessing the level of gene expression or protein expression and the biological 
activity of the protein, to assess severity and prognosis of SMA. 

In other related aspects, the invention includes a nucleic acid encoding a 
eukaryotic SIP1 operabiy iinked to a nucleic acid comprising a promoter/regulatory 

15 sequence such that the nucleic acid is preferably capable of directing expression of the 

protein encoded by the nucleic acid. 

Expression of SIP1 either alone or fused to a detectable tag polypeptide 
in cells which either do not normally express SIP1 or which do not express SIP1 
comprising a tag polypeptide, can be accomplished by operabiy linking the nucleic acid 

20 encoding SIP1 to a promoter/regulatory sequence which serves to drive expression of 
the protein, with or without a tag polypeptide, in cells in which the exogenous nucleic 
acid (i.e. transgene) is introduced. 

Many promoter/regulatory sequences useful for driving constitutive 
expression of a gene are available in the art and include, but are not limited to, for 

25 example, the cytomegalovirus immediate early promoter enhancer sequence, the SV40 
early promoter, both of which were used in the experiments disclosed herein, as well as 
the Rous sarcoma virus promoter, and the like. Moreover, inducible and tissue specific 
expression of the nucleic acid encoding SIP1 may be accomplished by placing the 
nucleic acid encoding SIP1, with or without a tag, under the control of an inducible or 

30 tissue specific promoter/regulatory sequence. Examples of tissue specific or inducible 
promoter/regulatory sequences which are useful for his purpose include, but are not 
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limited to the MMTV LTR inducible promoter, and the SV40 late enhancer/promoter. 
In addition, promoters which are well known in the art which are induced in response 
to inducing agents such as metals, glucocorticoids, and the like, are also contemplated 
in the invention. Thus, it will be appreciated that the invention includes the use of any 
5 promoter/regulatory sequence, which is either known or unknown, and which is 

capable of driving expression of the desired protein encoded by a nucleic acid operably 
linked to the promoter/regulatory sequence. 

Similarly, the invention includes a nucleic acid encoding a mammalian 
Gemin3 operably linked to a nucleic acid comprising a promoter/regulatory sequence 

10 such that the nucleic acid is preferably capable of directing expression of the protein 

encoded by the nucleic acid. Further, the present invention encompasses a nucleic acid 
encoding a chicken SMN operably linked to a nucleic acid comprising a 
promoter/regulatory sequence. 

Expressing SIP1, Gemin3, or chicken SMN using a promoter/regulatory 

15 sequence allows the isolation of large amounts of recombinantly produced protein. 

Further, where the lack or decreased level of SIP1, Gemin3, or SMN expression causes 
a disease or condition associated with such expression, the expression of the protein 
driven by a promoter/regulatory sequence can provide useful therapeutics including, 
but not limited to, gene therapy whereby the protein is provided. 

20 Vectors and recombinant cells 

The invention also includes a vector comprising a nucleic acid encoding 
a eukaryotic SIP1, a mammalian Gemin3, or a chicken SMN. The incorporation of a 
desired nucleic acid into a vector and the choice of vectors is well-known in the art as 
described in, for example, Sambrook et al., supra, and Ausubel et al., supra. 

25 Further, the invention encompasses expression vectors and methods for the 

introduction of exogenous nucleic acid encoding SMN, SIP1, and Gemin3 into a cell 
with concomitant expression of the exogenous nucleic acid in the cell using such 
methods as those described in, for example, Sambrook et al. (1989, supra), and 
Ausubel et al. (1997, supra), and as disclosed elsewhere herein. 

30 Expression of chicken SMN from a plasmid under the control of an inducible 

promoter/regulatory sequence allows the selective expression of SMN in a cell lacking 
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endogenous SMN expression. As more fully set forth below, this cell provides a stable 
genetic system for the study of the role(s) of SMN in SMA and for the identification of 
SMA therapeutics to treat the effects of the lack of SMN in the cell. 

Selection of any particular plasmid vector or other DNA vector is not a 
5 limiting factor in this invention and a wide plethora vectors is well-known in the art. 

Further, it is well within the skill of the artisan to choose particular promoter/regulatory 
sequences and operably link those promoter/regulatory sequences to a DNA sequence 
encoding a desired polypeptide. Such technology is well known in the art and is 
described, for example, in Sambrook, supra, and Ausubel, supra. 
10 The invention includes also cells, viruses, proviruses, and the like, 

containing such vectors. Methods for producing cells comprising vectors and/or 
exogenous nucleic acids are well-known in the art. See, e.g., Sambrook et al., supra; 
Ausubel et al., supra. 

The nucleic acids encoding SIP1, Gemin3, or chicken SMN can be 
15 cloned into various plasmid vectors. However, the present invention should not be 

construed to be limited to plasmids or to any particular vector. Instead, the present 
invention should be construed to encompass a wide plethora of vectors which are 
readily available and/or well-known in the art. 

The invention also includes the expression of human SIP1 in non- 
20 human cells where it is not normally expressed or expression of SIP 1 -tagged fusion 
protein in cells where this fusion protein is not normally expressed. In a preferred 
embodiment, human SIP1, human SMN, and Gemin3 nucleic acid tagged with a 
nucleic acid expressing a tag polypeptide was used to transiently transfect a 
mammalian cell. Plasmid constructs containing SMN, SIP1, or Gemin3, or mutants, 
25 variants, derivatives and fragments thereof, were cloned into several vectors 

comprising a nucleic acid encoding a tag polypeptide (see, e.g., Nakielny and Dreyfuss, 
1996, J. Cell. Biol. 134:1365-1373). The plasmids were introduced into the cell using 
standard methods (e.g., calcium phosphate, electroporation, and the like). 
In another embodiment, human SIP1, or SMN were expressed mXenopus laevis 
30 oocytes where the exogenous nucleic acid encoding the mammalian proteins were 
introduced into the oocytes by standard microinjection techniques. The present 
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invention also encompasses expression of these exogenous nucleic acids in amphibian 
and other non-mammalian cells {e.g. yeast, insect, and avian cells) using methods well- 
known in the art such as those disclosed elsewhere herein. Thus, it is clear that the 
invention is not limited to any particular vector or to any particular method of 
5 introducing the exogenous nucleic acid encoding at least one of SMN, SIP1, and 

Gemin3 into a cell. 

Expression of proteins of interest {e.g., SMN, SIP1, and Gemin3) in a 
cell, especially when the protein comprises a tag polypeptide, allows localization of the 
nucleic acid and/or the protein expressed therefrom within the cell under selected 
10 conditions such that the function(s) of the protein in the cell can be studied and 

identified. 

One skilled in the art would appreciate, based upon the disclosure 
provided herein, that the invention also includes expression of SIP 1, SMN, Gemin3, 
and the like, in prokaryotic cells {e.g., bacterial cells such as, for example, E. coli). 
1 5 Accordingly, the invention includes expression of the proteins of the invention in such 

cells as well. 

In another preferred embodiment, SMN and SIP1 nucleic acids which 
expressed a tag polypeptide were also expressed in an in vitro transcription-translation 
system as well as in various E. coli strains to produce SMN or SIP1 protein variants or 

20 fragments fused to a tag polypeptide. Thus, the invention encompasses the expression 

of SMN, SIP1, and Gemin in a cell free system in addition to the expression of such 
proteins in a cell. 

However, the invention should not be construed as being limited to 
these plasmid vectors, bacterial strains, or to these tag polypeptides. Further, the 

25 invention is not limited to calcium phosphate transfection or to HeLa cells as 

exemplified herein. Instead, the invention encompasses other expression vectors and 
methods for the introduction of exogenous DNA into cells with concomitant expression 
of the exogenous DNA in the cells such as those described, for example, in Sambrook 
et al. (1989, supra), and Ausubel et al. (1997, supra). 

30 The invention also includes a cell comprising an isolated nucleic acid 

encoding at least one of an SIP1, and a Gemin3, or a vector comprising the same. 
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The invention relates to a cell line as an in vitro model of SMA for the study of defects 
in SMN and its associated proteins in mRNA biogenesis and for the screening of 
compounds which affect SMN activity. The cell line of the invention is one which 
exhibits reduced expression of SMN when compared with normal levels of expression 
of SMN in cells or, alternatively, exhibits increased or altered expression of SMN 
when compared to the level of SMN expression in an otherwise identical cell which 
does not comprise the exogenous nucleic acid encoding SMN. 
Cell lines can be created which exhibit enhanced expression of SMN; however, the 
preferred cell line of the invention is one in which the expression of SMN is reduced 
when compared with normal expression of SMN. The cell line of the invention should 
not be construed to be limited by the manner in which the expression of SMN is 
reduced when compared with normal expression of SMN, in that there are any number 
of ways to reduce expression of SMN in a cell. 

In one embodiment, the cell line is mammalian cell comprising an 
expression vector comprising the human SMN cDNA constitutively expressed under 
the control of a high-level expression promoter/regulatory sequence. The cells may be 
transfected with constructs which comprise SMN cDNA in either a sense (i.e., sense 
cells) or an antisense orientation (i.e., antisense cells). 

In another embodiment exemplified herein, the SMN modulating 
sequence was a plasmid wherein a sequence encoding a fusion protein comprising an 
HA tag polypeptide fused to the ammo-terminus of chicken SMN (HA-cSMN) was 
expressed under a tetracycline repressible promoter. As the data disclosed herein 
demonstrate, removal of tetracycline allowed derepression of the promoter such that 
the HA-cSMN fusion was expressed in the cell (e.g., a chicken pre-B lymphoid DT40 
cell although other cell lines can be used). Without wishing to be bound by theory, 
expression of the HA-cSMN caused expression of endogenous cSMN to be inhibited 
such that only expression of HA-SMN was detected in the cell. This apparent 
autoregulation of SMN expression, in turn, was lethal since the HA-cSMN did not 
replace endogenous wild type cSMN in the cell. Thus, one skilled in the art would 
appreciate, based upon the disclosure provided herein, that this cell, wherein a non- 
functional SMN replaces endogenous SMN, provides an important system analogous to 
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a cell of an SMA patient in that the expression and, therefore, function of SMN are 
inhibited. Thus, this cell of the invention provides a useful system for the study of the 
role(s) of SMN in SMA and also for the identification of a compound useful for 
treatment of SMA which is characterized by the lack of endogenous expression of 
5 functional SMN. 

One skilled in the art would further appreciate that selected forms of 
SMN may be introduced to the cell of the invention in order to study the effect of any 
mutant, derivative, and variant of SMN (e.g. , fusion proteins comprising at least a 
portion of SMN and a tag polypeptide) in this system. 
1 0 Additionally, in another embodiment, the invention relates to a cell line 

transfected with empty vector only (i.e., vector-only cells also referred to as "control 
cells"). 

Additionally, the use of ribozymes to effect a reduction in the 
expression of SMN in a cell line is contemplated, as is the use of any other means 

1 5 which would effect a reduction in expression of SMN in a cell line such as the use of 

knock-out and knock-in techniques as described elsewhere herein to affect SMN 
expression in a cell. 

One skilled in the art would appreciate that an antisense cell line, for 
example, serves as an in vitro model for SMA in that these cells contain reduced 

20 amounts of SMN which is analogous to the diseased motor neurons of SMA patients. 

Further, these cells can also demonstrate altered morphologies and growth 
characteristics which are also similar to naturally occurring SMA cell pathology. 
However, the invention is not limited to a cell line expressing lower levels of SMN 
protein. Indeed, the invention also discloses methods of producing cell lines which 

25 exhibit increased levels of SMN, /. e. , cells transfected with vector encoding SMN in a 

sense orientation under the control of a constitutive promoter/regulatory sequences 
which drives expression of SMN protein at higher levels than that found in cells which 
are not transfected or which are transfected with an empty vector only. 
The invention is not limited to these cell lines or to any particular altered growth 

30 characteristics. Rather, other cell lines may be developed using the methods described 
herein to produce in vitro models of SMA. The identity of the cell line is not critical, 
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except that the cell line must exhibit reduced expression of SMN protein which is 
analogous to the reduced level of expression exhibited by the cells of SMA patients. 
Alternatively, the invention encompasses production of a cell that exhibits an increased 
level of SMN protein expression compared with an otherwise identical cell which is 
5 not altered in any way. Examples of cell lines which may produce proteins encoded by 
a variety of expression vectors are found throughout the scientific literature. Also, the 
manner of providing the SMN modulating sequence to the cell is not critical except 
that, whether it is transcribed or translated, the nucleic acid must decrease or increase 
the levels of SMN available to participate in mRNA biogenesis compared with 
1 0 otherwise identical cells to which the SMN modulating sequence has not been 

provided. 

One skilled in the art will appreciate, based on the disclosure provided herein, that the 
level of SMN protein in a cell can be decreased by reducing or inhibiting expression of 
the SMN gene. Thus, the level of SMN protein in a ceil can be decreased using a 

15 molecule or compound that inhibits or reduces gene expression such as, for example, 

antisense molecules or ribozymes. 

In a preferred embodiment, the SMN modulating sequence is a SMN 
antisense nucleic acid sequence which is expressed by a plasmid vector and used to 
transfect a mammalian cell thereby causing reduced endogenous expression of SMN 

20 protein in the cells. However, as stated previously herein, the invention should not be 
construed to be limited to inhibiting expression of SMN by transfection of cells with 
antisense molecules. Rather, the invention encompasses other methods known in the 
art for inhibiting expression of SMN protein in cells including, but not limited to, the 
use of ribozymes, and expression of a non-functional SMN under the control of, for 

25 example, an inducible promoter, and the like. 

In another preferred embodiment, the transfected cells exhibit altered 
growth characteristics compared with cells which are either not transfected or which 
are transfected with an empty plasmid vector or with a vector containing SMN in a 
sense orientation. 

30 The invention should not be construed as being limited to a cell 

exhibiting any particular altered growth characteristics or to a cell exhibiting any 
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particular combination of altered characteristics. Rather, the invention includes other 
altered morphological or growth features and combinations thereof. 

Further, the invention includes a recombinant cell comprising an 
antisense nucleic acid which cell is a useful model for the study of SMA and/or other 
5 conditions associated with or mediated by inhibition of mRNA biosynthesis and for 
elucidating the role(s) of SIP1, SMN, and/or Gemin3 in such processes. That is, the 
lack of expression of SMN in SMA patients, among other things, indicated that SMN 
was involved in SMA. Accordingly, a recombinant (i.e., transgenic) cell comprising an 
antisense nucleic acid complementary to SMN is a useful tool for the study of the 

10 mechanism(s) of action of SMN and its role(s) in the cell and for the identification of 

therapeutics that ameliorate the effect(s) of decreased levels of SMN expression. 

One skilled in the art will appreciate that one way to decrease the levels 
of SMN mRNA and/or protein in a cell is to inhibit expression of the nucleic acid 
encoding the protein. Expression of SMN may be inhibited using, for example, 

15 antisense molecules, nucleic acids encoding non-functional SMN, and also by using 

ribozymes. 

Antisense molecules and their use for inhibiting gene expression are 
well known in the art {see, e.g., Cohen, 1989, In: Oligodeoxyribonucleotides, 
Antisense Inhibitors of Gene Expression, CRC Press). Antisense nucleic acids are 
20 DNA or RNA molecules that are complementary, as that term is defined elsewhere 

herein, to at least a portion of a specific mRNA molecule (Weintraub, 1990, Scientific 
American 262:40). In the cell, antisense nucleic acids hybridize to the corresponding 
mRNA, forming a double-stranded molecule thereby inhibiting the translation of 
genes. 

25 The use of antisense methods to inhibit the translation of genes is 

known in the art, and is described, for example, in Marcus-Sakura (988, Anal. 
Biochem. 172:289). Such antisense molecules may be provided to the cell via genetic 
expression using DNA encoding the antisense molecule as taught by Inoue, 1993, U.S. 
Patent No. 5,190,931. 

30 Alternatively, antisense molecules of the invention may be made 

synthetically and then provided to the cell. Antisense oligomers of between about 10 
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to about 30, and more preferably about 15 nucleotides, are preferred, since they are 
easily synthesized and introduced into a target cell. Synthetic antisense molecules 
contemplated by the invention include oligonucleotide derivatives known in the art 
which have improved biological activity compared to unmodified oligonucleotides (see 
5 Cohen, supra; Tullis, 1991, U.S. Patent No. 5,023,243, incorporated by reference 

herein in its entirety). 

Ribozymes and their use for inhibiting gene expression are also well 
known in the art {see, e.g., Cech et al., 1992, J. Biol. Chem. 267:17479-17482; Hampel 
et al., 1989, Biochemistry 28:4929-4933; Eckstein et al., International Publication No. 

10 WO 92/07065; Altaian et al., U.S. Patent No. 5,168,053, incorporated by reference 

herein in its entirety). Ribozymes are RNA molecules possessing the ability to 
specifically cleave other single-stranded RNA in a manner analogous to DNA 
restriction endonucleases. Through the modification of nucleotide sequences encoding 
these RNAs, molecules can be engineered to recognize specific nucleotide sequences in 

15 an RNA molecule and cleave it (Cech, 1988, J. Amer. Med. Assn. 260:3030). A major 

advantage of this approach is that, because they are sequence-specific, only mRNAs 
with particular sequences are inactivated. 

There are two basic types of ribozymes, namely, tetrahymena-type 
(Hasselhoff, 1988, Nature 334:585) and hammerhead-type. Tetrahymena-type 

20 ribozymes recognize sequences which are four bases in length, while hammerhead-type 

ribozymes recognize base sequences 11-18 bases in length. The longer the sequence, 
the greater the likelihood that the sequence will occur exclusively in the target mRNA 
species. Consequently, hammerhead-type ribozymes are preferable to tetrahymena- 
type ribozymes for inactivating specific mRNA species, and 18-base recognition 

25 sequences are preferable to shorter recognition sequences which may occur randomly 

within various unrelated mRNA molecules. 

Ribozymes useful for inhibiting the expression of SMN may be 
designed by incorporating target sequences into the basic ribozyme structure which are 
complementary to the mRNA sequence of the SMN encoded by SMN or having at least 

30 about 80% homology to chicken SMN (SEQ ID NO:9). Ribozymes targeting SMN 
may be synthesized using commercially available reagents (Applied Biosystems, Inc., 
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Foster City, CA) or they may be genetically expressed from DNA encoding them. 

The invention further includes a recombinant cell comprising an isolated 
nucleic acid encoding SMN, SIP1, or Gemin3. The transgenic cell may be transiently 
transfected with a plasmid encoding a portion of the nucleic acid encoding the protein 
5 of interest, .e.g., SMN. The nucleic acid need not be integrated into the cell genome 

nor does it need to be expressed in the cell. Moreover, the cell may be a prokaryotic or 
a eukaryotic cell and the invention should not be construed to be limited to any 
particular cell line or cell type. Such cells include, but are not limited to, Xenopus 
oocytes, chicken DT40 pre-B lymphoid cells, mammalian neuronal cells, E. coli, and 
10 the like. 

When the cell is a eukaryotic cell, the cell may be any eukaryotic cell 
which, when the isolated nucleic acid of the invention is introduced therein, and the 
protein encoded by the desired gene is no longer expressed therefrom, a benefit is 
obtained. Such a benefit may include the fact that there has been provided a system in 

15 which lack of expression of the desired gene can be studied in vitro in the laboratory or 

in a mammal in which the cell resides, a system wherein cells comprising the 
introduced gene deletion can be used as research, diagnostic and therapeutic tools, and 
a system wherein animal models are generated which are useful for the development of 
new diagnostic and therapeutic tools for selected disease states in a mammal including, 

20 for example, SMA. 

Alternatively, the invention includes a eukaryotic cell which, when the transgene of the 
invention is introduced therein, and the protein encoded by the desired gene is 
expressed therefrom where it was not previously present or expressed in the cell or 
where it is now expressed at a level or under circumstances different than that before 

25 the transgene was introduced, a benefit is obtained. Such a benefit may include the fact 

that there has been provided a system wherein the expression of the desired gene can 
be studied in vitro in the laboratory or in a mammal in which the cell resides, a system 
wherein cells comprising the introduced gene can be used as research, diagnostic and 
therapeutic tools, and a system wherein animal models are generated which are useful 

30 for the development of new diagnostic and therapeutic tools for selected disease states 
in a mammal (e.g., SMA). 
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The present invention is not limited to a cell line wherein expression of 
SMN is affected by introduction of an SMN modulating sequence to the cell. That is, 
one skilled in the art would appreciate, based on the disclosure provided herein, that 
there are other methods of inhibiting or reducing expression of a nucleic acid encoding 
5 a protein of interest (e.g. , SMN, SIP1, and Gemin3). Thus, the invention further 

includes a cell line wherein the endogenous nucleic acid encoding the protein of 
interest has been removed, either in part or in its entirety, from the cell chromosome. 
Further, the invention includes a cell line where the endogenous nucleic acid encoding 
a protein has been removed where the cell further comprises a vector comprising a 
1 0 nucleic acid encoding the protein such that the expression of the protein in such cell 

can be controlled. 

Therefore, the present invention includes a cell line wherein endogenous SMN 
expression has been ablated or significantly reduced and, instead, inducible SMN 
expression from a stably transfected piasmid has an been provided to the cell. One 

1 5 skilled in the art would appreciate, based upon the disclosure provided herein, that this 

stable genetic system provides an important tool for the study of effect of SMN 
expression in a cell, the role(s) of SMN and proteins associated therewith in cell 
processes, and for the identification of therapeutics useful for the treatment of SMA. 

One pf ordinary skill would appreciate, based upon the disclosure 

20 provided herein, that a targeting vector useful for inserting (/. e. , a "knock-in" targeting 
vector) or deleting (i.e., a "knock-out" targeting vector) nucleic acid sequences of 
interest in a cell comprises at least two sequences homologous to two portions of the 
nucleic acid which is to be deleted or replaced. In the case of the knock-out targeting 
vector, the two sequences are homologous with sequences that flank the gene; that is, 

25 one sequence is homologous with a region at or near the 5 f portion of the coding 

sequence of the nucleic acid encoding, for example, SMN, and the other sequence is 
further downstream from the first. One skilled in the art would appreciate, based upon 
the disclosure provided herein, that the present invention is not limited to any specific 
flanking nucleic acid sequences. Instead, the targeting vector may comprise two 

30 sequences which remove some or all (in the case of a "knock-in" vector) or which 
insert (in the case of a "knock-in" vector) a nucleic encoding SMN, or a fragment 



10873 v.2 



-54- 



thereof, into a mammalian genome. The crucial feature of the targeting vector is that it 
comprise sufficient portions of two sequences located towards opposite, Le. 9 5 f and 3 ! , 
ends of the SMN open reading frame (ORF) such as to allow homologous 
recombination to occur such that all or a portion of the nucleic acid encoding SMN is 
5 deleted from a location on a mammalian chromosome. Alternatively, one of ordinary 

skill in the art would appreciate that a knock-in targeting vector preferably comprises 
sequences homologous to and flanking a location on a mammalian chromosome which 
will be exchanged with the gene to be inserted (e.g., SMN, SIP1, and Gemin3). The 
design of transgenes and knock-in and knock-out targeting vectors is well-known in the 

10 art and is described in standard treatises such as Sambrook et al. (1989, Molecular 

Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York), and in 
Ausubel et al. (1997, Current Protocols in Molecular Biology, John Wiley & Sons, 
New York), and the like. The upstream and downstream portions flanking or within 
the chicken SMN coding region, or sequences flanking a region to be replaced with 

15 SMN by a knock-in vector, to be used in the targeting vector may be easily selected 

based upon known methods and following the teachings disclosed herein based on the 
disclosure provided herein including the nucleic and amino acid sequences of chicken 
SMN. Armed with these sequences, one of ordinary skill in the art would be able to 
construct the transgenes and knock-out vectors of the invention. 

20 The invention further includes a knock-out targeting vector comprising 

a nucleic acid encoding a selectable marker such as, for example, a nucleic acid 
encoding the neo R gene thereby allowing the selection of a transgenic 
recombinant) cell where the nucleic acid encoding SMN, or a portion thereof, has been 
deleted and replaced with the neomycin resistance gene thereby conferring on the 

25 recombinant cell the ability to grow in the presence of G418. However, the present 

invention should not be construed to be limited to neomycin resistance as a selectable 
marker. Rather, other selectable markers well-known in the art may be used in the 
knock-out targeting vector to allow selection of transgenic cells where the SMTVgene 
has been deleted and/or inactivated and replaced by the nucleic acid encoding the 

30 selectable marker of choice. Methods of selecting and incorporating a selectable 
marker into a vector are well-known in the art and are described in, for example, 
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Sambrook et al. (1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, New York), and in Ausubel et al. (1997, Current Protocols in Molecular 
Biology, John Wiley & Sons, New York). 
Antibodies 

5 The invention also includes an antibody that specifically binds SIP1 , or 

a fragment thereof. In one aspect, the antibody specifically binds a protein having the 
amino acid sequence of at least one of SEQ ID NO:2, SEQ ID NO:4, and SEQ ID 
NO:6, or a protein sharing about 20% homology with a protein having those sequences. 
In one embodiment disclosed herein, the antibody is a murine monoclonal antibody 

1 0 (e.g. , 2S7, 2E1 7). However, the present invention is not limited to the particular 

antibodies exemplified herein nor is the invention limited to monoclonal antibodies. 
Rather, the invention encompasses polyclonal, monoclonal, synthetic antibodies, and 
the like. One skilled in the art would understand, based upon the disclosure provided 
herein, that the crucial feature of the antibody of the invention is that the antibody bind 

15 specifically with SIP 1 in a manner similar to those antibodies disclosed herein. More 
specifically, the antibody of the invention recognizes SIP1 (e.g., human, Xenopus, and 
mouse), or a fragment thereof, on Western blots, in immunostaining of cells, and 
immunoprecipitates SIP1 using standard methods well-known in the art. Moreover, 
selected antibodies of the invention can inhibit mRNA splicing reaction and or the 

20 formation of the SMN-SIP1 complex in vivo and/or in vitro. 

In addition, the invention includes an antibody that specifically binds 
with Gemin3, or a fragment thereof. In one aspect, the antibody specifically binds a 
protein having the amino acid sequence of SEQ ID NO; 1 0. In another aspect, the 
antibody specifically binds with a protein sharing about 20% homology with a protein 

25 having the sequence SEQ ID NO: 10, or a fragment thereof. 

In one embodiment disclosed herein, the antibody is a murine 
monoclonal antibody (e.g., 1 1G9, 12H12). However, the present invention is not 
limited to the particular antibodies exemplified herein nor is the invention limited to 
monoclonal antibodies. Rather, the invention encompasses polyclonal, monoclonal, 

30 synthetic antibodies, and the like. One skilled in the art would understand, based upon 
the disclosure provided herein, that the crucial feature of the antibody of the invention 



is that it bind specifically with Gemin3 in a manner similar to the antibodies disclosed 
herein. More specifically, the antibody of the invention recognizes Gemin3, or a 
fragment thereof, on Western blots, in immunostaining of cells, and 
immunoprecipitates Gemin3 using standard methods well-known in the art. 
5 In addition, the invention includes an antibody that specifically binds with SMN, or a 
fragment thereof. In one aspect, the antibody specifically binds a protein having the 
amino acid sequence of SEQ ID NO: 8. In another aspect, the antibody specifically 
binds with a protein sharing about 20% homology with a protein having the sequence 
SEQ ID NO:8, or a fragment thereof. 

10 In one embodiment disclosed herein, the antibody is a murine 

monoclonal antibody (e.g., 2B1) that specifically binds with chicken SMN. However, 
the present invention is not limited to the particular antibody exemplified herein nor is 
the invention limited to monoclonal antibodies. Rather, the invention encompasses 
polyclonal, monoclonal, synthetic antibodies, and the like. One skilled in the art would 

1 5 appreciate, based upon the disclosure provided herein, that the crucial feature of the 

antibody of the invention is that it bind specifically with SMN in a maimer similar to 
those antibodies disclosed herein. More specifically, the antibody of the invention 
recognizes human SMN, or a fragment thereof, on Western blots, in immunostaining of 
cells, and immunoprecipitates SMN using standard methods well-known in the art. 

20 Further, preincubation of a splicing extract can also inhibit pre-mRNA splicing as 
disclosed elsewhere herein. 

In addition, the invention includes an antibody to chicken SMN. Such 
antibody can be a monoclonal antibody, a monoclonal antibody, and the like, as 
previously described elsewhere herein. 

25 The antibodies of the invention are useful for the diagnosis, assessment 

and treatment of SMA. These antibodies are also useful for elucidating the 
mechanism(s) for pre-mRNA splicing and the causal relationship between such 
processes and SMA. Further, the antibodies can be used to localize the relevant protein 
in a cell and to study the role(s) of the antigen recognized thereby in cell processes. 

30 Moreover, the antibodies can be used to detect and or measure the amount of protein 
present in a biological sample using well-known methods such as, but not limited to, 
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Western blotting and enzyme-linked immunosorbent assay (ELISA). Moreover, the 
antibodies can be used to immunoprecipitate and/or immuno-affinity purify their 
cognate antigen as described elsewhere herein. 

The generation of polyclonal antibodies is accomplished by inoculating 
5 the desired animal with the antigen and isolating antibodies which specifically bind the 

antigen therefrom using standard antibody production methods such as those described 
in, for example, Harlow et al. (1988, In: Antibodies, A Laboratory Manual, Cold 
Spring Harbor, NY). 

Monoclonal antibodies directed against full length or peptide fragments 

10 of a protein or peptide can be prepared using any well known monoclonal antibody 
preparation procedures, such as those described, for example, in Harlow et al., 1988, 
supra, and in Tuszynski et al. (1988, Blood, 72:109-1 15), and methods set forth 
elsewhere herein. Quantities of the desired peptide may also be synthesized using 
chemical synthesis technology. Alternatively, DNA encoding the desired peptide may 

15 be cloned and expressed from an appropriate promoter sequence in cells suitable for 

the generation of large quantities of peptide. Monoclonal antibodies directed against 
the peptide are generated from mice immunized with the peptide using standard 
procedures as referenced herein. 

Nucleic acid encoding the monoclonal antibody obtained using the 

20 procedures described herein may be cloned and sequenced using technology which is 
available in the art, and is described, for example, in Wright et al. (1992, Critical Rev. 
Immunol. 12: 125-168), and the references cited therein. Further, the antibody of the 
invention may be "humanized" using the technology described in Wright et al. (supra), 
and in the references cited therein, and in Gu et al. (1997, Thrombosis and Hematocyst 

25 77:755-759). 

To generate a phage antibody library, a cDNA library is first obtained 
from mRNA which is isolated from cells, e.g., the hybridoma, which express the 
desired protein to be expressed on the phage surface, e.g., the desired antibody. cDNA 
copies of the mRNA are produced using reverse transcriptase. cDNA which specifies 
30 immunoglobulin fragments are obtained by PCR and the resulting DNA is cloned into 
a suitable bacteriophage vector to generate a bacteriophage DNA library comprising 
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DNA specifying immunoglobulin genes. The procedures for making a bacteriophage 
library comprising heterologous DNA are well known in the art and are described, for 
example, in Sambrook et al., supra. 

Bacteriophage which encode the desired antibody, may be engineered 
5 such that the protein is displayed on the surface thereof in such a manner that it is 

available for binding to its corresponding binding protein, e.g., the antigen against 
which the antibody is directed. Thus, when bacteriophage which express a specific 
antibody are incubated in the presence of a cell which expresses the corresponding 
antigen, the bacteriophage will bind to the cell. Bacteriophage which do not express 

10 the antibody will not bind to the cell. Such panning techniques are well known in the 

art and are described for example, in Wright et ah {supra). 

Processes such as those described above, have been developed for the 
production of human antibodies using M13 bacteriophage display (Burton et al., 1994, 
Adv. Immunol. 57:191-280). Essentially, a cDNA library is generated from mRNA 

15 obtained from a population of antibody-producing cells. The mRNA encodes 

/ rearranged immunoglobulin genes and thus, the cDNA encodes the same. Amplified 
cDNA is cloned into Ml 3 expression vectors creating a library of phage which express 
human Fab fragments on their surface. Phage which display the antibody of interest 
are selected by antigen binding and are propagated in bacteria to produce soluble 

20 human Fab immunoglobulin. Thus, in contrast to conventional monoclonal antibody 
synthesis, this procedure immortalizes DNA encoding human immunoglobulin rather 
than cells which express human immunoglobulin. 

The procedures just presented describe the generation of phage which 
encode the Fab portion of an antibody molecule. However, the invention should not be 

25 construed to be limited solely to the generation of phage encoding Fab antibodies. 

Rather, phage which encode single chain antibodies (scFv/phage antibody libraries) are 
also included in the invention. Fab molecules comprise the entire Ig light chain, that is, 
they comprise both the variable and constant region of the light chain, but include only 
the variable region and first constant region domain (CHI) of the heavy chain. Single 

30 chain antibody molecules comprise a single chain of protein comprising the Ig Fv 

fragment. An Ig Fv fragment includes only the variable regions of the heavy and light 
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chains of the antibody, having no constant region contained therein. Phage libraries 
comprising scFv DNA may be generated following the procedures described in Marks 
et al. (1991, J. Mol. Biol. 222:581-597). Panning of phage so generated for the 
isolation of a desired antibody is conducted in a manner similar to that described for 
5 phage libraries comprising Fab DNA. 

The invention should also be construed to include synthetic phage 
display libraries in which the heavy and light chain variable regions may be 
synthesized such that they include nearly all possible specificities (Barbas, 1995, 
Nature Medicine 1:837-839; de Kruif et al. 1995, J. Mol. Biol. 248:97-105). 
10 Compositions 

The invention includes a composition comprising an isolated purified 
polypeptide comprising a eukaryotic SIP1 (e.g., human SIP1, frog SIP1, and yeast 
homolog Brrl). Preferably, the composition comprises a pharmaceutically acceptable 

carrier. 

15 The invention includes a composition comprising an isolated purified 

SMN and a protein that specifically binds SMN (e.g., another SMN, SIP1, Gemin3, 
and various Sm proteins). The composition comprises the afore-mentioned proteins in 
any combination or permutation thereof. In one aspect, the composition further 
comprises a ribonucleic acid, such as, but not limited to, snRNA. The composition and 

20 the various permutations thereof, are an important tool for elucidating the precise 

structure of the mRNA splicing complex and the mechanism(s) involved in mRNA 
biosynthesis. 

Also included in the invention is a composition comprising an antibody 
that specifically binds SMN. Preferably, the composition comprises a 

25 pharmaceutically-acceptable carrier. Such an antibody can be administered to a SMA 
patient since the data disclosed herein demonstrate that addition of anti-SMN mAb 2B1 
increases aggregation of SMN which is deficient in SMA-afflicted humans since the 
SMN of SMA patients exhibits a decreased ability to aggregate with itself and with 
SIP1 indicating that such aggregation/binding is important and that compounds that 

30 increase such protein interactions should ameliorate or treat SMA by increasing such 
important interaction(s). 
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The invention further includes a composition comprising an isolated and 
purified SIP1 wherein the composition further comprises a pharmaceutically- 
acceptable carrier. Such a composition can be used to immunize a mammal in order to 
generate antibodies that specifically bind SIP1 . Further, the composition can be used 
5 to administer SIP1 to a mammal whereby a condition mediated by or associated with a 
decreased amount of SIP1 is ameliorated or treated by the administration of the protein 
to the mammal. 

The invention includes a composition comprising an isolated purified 
SMN and an isolated purified SIP1 . Such a novel composition, which is disclosed 

10 elsewhere herein, is a useful tool for the study the interaction between SMN and SIP1 
which is deficient in SMA patients thereby indicating that such protein-protein 
interaction is important in SMA. Further, the data disclosed herein demonstrate that 
SMN and SIP1 associate in vivo as part of a large molecular weight complex of about 
800 kDa which complex is involved in mRNA biosynthesis. Thus, elucidation of the 

15 nature of the interactions between these proteins is important in the study of mRNA 

biosynthesis. In addition, the composition is useful in molecular protein modeling 
assays (e.g., chrystallography, biosensor analyses, and the like) to determine the 
precise configuration of the complex and further characterize the association kinetics of 
the complex and to aid in the elucidation of the precise mechanism(s) involved in 

20 mRNA biosynthesis. 

The invention also includes a composition comprising an isolated 
purified SMN and an isolated purified Gemin3. The data disclosed herein demonstrate, 
for the first time, that SMN and Gemin3 associate in vivo as part of a high molecular 
weight complex. Thus, the study of the interaction between SMN and Gemin3 is 

25 important in elucidating SMA disease processes and in studying the role(s) and 

interaction(s) of these proteins with respect to mRNA biosynthesis. In addition, the 
composition is useful in molecular protein characterization assays (e.g., 
chrystallography, biosensor analyses, and the like) to determine the precise 
configuration of the proteins within the complex and to further characterize the 

30 association kinetics of the proteins and to aid in the elucidation of the precise 
mechanism(s) involved in mRNA biosynthesis. 
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For administration to of the above-mentioned compositions to a mammal, a 
polypeptide, or the nucleic acid encoding it, or both, can be suspended in any 
pharmaceutical^ acceptable carrier, for example, HEPES buffered saline at a pH of 
about 7.8. Other pharmaceutical^ acceptable carriers which are useful include, but are 
5 not limited to, glycerol, water, saline, ethanol and other pharmaceutical^ acceptable 

salt solutions such as phosphates and salts of organic acids. Examples of these and 
other pharmaceutically acceptable carriers are described in Remington's 
Pharmaceutical Sciences (1991, Mack Publication Co., New Jersey). 

The pharmaceutical compositions may be prepared, packaged, or sold in 

10 the form of a sterile injectable aqueous or oily suspension or solution. This suspension 
or solution may be formulated according to the known art, and may comprise, in 
addition to the active ingredient, additional ingredients such as the dispersing agents, 
wetting agents, or suspending agents described herein. Such sterile injectable 
formulations may be prepared using a non-toxic parenterally-acceptable diluent or 

15 solvent, such as water or 1,3-butane diol, for example. Other acceptable diluents and 

solvents include, but are not limited to, Ringer's solution, isotonic sodium chloride 
solution, and fixed oils such as synthetic mono- or di-glycerides. 

Pharmaceutical compositions that are useful in the methods of the 
invention may be administered, prepared, packaged, and/or sold in formulations 

20 suitable for oral, rectal, vaginal, parenteral, topical, pulmonary, intranasal, buccal, 
ophthalmic, or another route of administration. Other contemplated formulations 
include projected nanoparticles, liposomal preparations, resealed erythrocytes 
containing the active ingredient, and immunologically-based formulations. 

The compositions of the invention may be administered via numerous 

25 routes, including, but not limited to, oral, rectal, vaginal, parenteral, topical, 

pulmonary, intranasal, buccal, or ophthalmic administration routes. The route(s) of 
administration will be readily apparent to the skilled artisan and will depend upon any 
number of factors including the type and severity of the disease being treated, the type 
and age of the veterinary or human patient being treated, and the like. 

30 Pharmaceutical compositions that are useful in the methods of the 

invention may be administered systemically in oral solid formulations, ophthalmic, 
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suppository, aerosol, topical or other similar formulations. In addition to the 
compound such as heparan sulfate, or a biological equivalent thereof, such 
pharmaceutical compositions may contain pharmaceutically-acceptable carriers and 
other ingredients known to enhance and facilitate drug administration. Other possible 
5 formulations, such as nanoparticles, liposomes, resealed erythrocytes, and 

immunologically based systems may also be used to administer SMN, alone or in 
combination with at least one protein specifically associated with SMN (e.g., another 
SMN, SIP1, Gemin3, and Sm proteins), and/or a nucleic acid encoding the same. 
In addition, any compound identified using any of the methods described herein can be 

10 formulated and administered to a mammal for treatment of SMA, and/or any disease 
or condition associated with a defect in mRNA biosynthesis. 

The invention encompasses the preparation and use of pharmaceutical 
compositions comprising a compound useful for treatment of SMA identified using a 
method of the invention as an active ingredient. Such a pharmaceutical composition 

15 may consist of the active ingredient alone, in a form suitable for administration to a 

subject, or the pharmaceutical composition may comprise the active ingredient and one 
or more pharmaceutically acceptable carriers, one or more additional ingredients, or 
some combination of these. The active ingredient may be present in the 
pharmaceutical composition in the form of a physiologically acceptable ester or salt, 

20 such as in combination with a physiologically acceptable cation or anion, as is well 

known in the art. 

The formulations of the pharmaceutical compositions described herein 
may be prepared by any method known or hereafter developed in the art of 
pharmacology. In general, such preparatory methods include the step of bringing the 

25 active ingredient into association with a carrier or one or more other accessory 

ingredients, and then, if necessary or desirable, shaping or packaging the product into a 
desired single- or multi-dose unit. 

Although the descriptions of pharmaceutical compositions provided 
herein are principally directed to pharmaceutical compositions which are suitable for 

30 ethical administration to humans, it will be understood by the skilled artisan that such 
compositions are generally suitable for administration to animals of all sorts. 
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Modification of pharmaceutical compositions suitable for administration to humans in 
order to render the compositions suitable for administration to various animals is well 
understood, and the ordinarily skilled veterinary pharmacologist can design and 
perform such modification with merely ordinary, if any, experimentation. Subjects to 
which administration of the pharmaceutical compositions of the invention is 
contemplated include, but are not limited to, humans and other primates, mammals 
including commercially relevant mammals such as cattle, pigs, horses, sheep, cats, and 
dogs. In addition, the administration of the compositions to birds is also contemplated. 

Pharmaceutical compositions that are useful in the methods of the 
invention may be prepared, packaged, or sold in formulations suitable for oral, rectal, 
vaginal, parenteral, topical, pulmonary, intranasal, buccal, ophthalmic, intrathecal or 
another route of administration. Other contemplated formulations include projected 
nanoparticles, liposomal preparations, resealed erythrocytes containing the active 
ingredient, and irmriunologicaliy-based formulations. 

A pharmaceutical composition of the invention may be prepared, 
packaged, or sold in bulk, as a single unit dose, or as a plurality of single unit doses. 
As used herein, a "unit dose 11 is discrete amount of the pharmaceutical composition 
comprising a predetermined amount of the active ingredient. The amount of the active 
ingredient is generally equal to the dosage of the active ingredient which would be 
administered to a subject or a convenient fraction of such a dosage such as, for 
example, one-half or one-third of such a dosage. 

The relative amounts of the active ingredient, the pharmaceutically 
acceptable carrier, and any additional ingredients in a pharmaceutical composition of 
the invention will vary, depending upon the identity, size, and condition of the subject 
treated and further depending upon the route by which the composition is to be 
administered. By way of example, the composition may comprise between 0.1% and 
100% (w/w) active ingredient. 

In addition to the active ingredient, a pharmaceutical composition of the 
invention may further comprise one or more additional pharmaceutically active agents. 
Particularly contemplated additional agents include anti-emetics and scavengers such 
as cyanide and cyanate scavengers. 
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Controlled- or sustained-release formulations of a pharmaceutical 
composition of the invention may be made using conventional technology. 
A formulation of a pharmaceutical composition of the invention suitable for oral 
administration may be prepared, packaged, or sold in the form of a discrete solid dose 
5 unit including, but not limited to, a tablet, a hard or soft capsule, a cachet, a troche, or a 

lozenge, each containing a predetermined amount of the active ingredient. Other 
formulations suitable for oral administration include, but are not limited to, a powdered 
or granular formulation, an aqueous or oily suspension, an aqueous or oily solution, or 
an emulsion. 

10 As used herein, an "oily" liquid is one which comprises a carbon- 

containing liquid molecule and which exhibits a less polar character than water. 
A tablet comprising the active ingredient may, for example, be made by compressing 
or molding the active ingredient, optionally with one or more additional ingredients. 
Compressed tablets may be prepared by compressing, in a suitable device, the active 

15 ingredient in a free-flowing form such as a powder or granular preparation, optionally 

mixed with one or more of a binder, a lubricant, an excipient, a surface active agent, 
and a dispersing agent. Molded tablets may be made by molding, in a suitable device, 
a mixture of the active ingredient, a pharmaceutically acceptable carrier, and at least 
sufficient liquid to moisten the mixture. Pharmaceutically acceptable excipients used 

20 in the manufacture of tablets include, but are not limited to, inert diluents, granulating 

and disintegrating agents, binding agents, and lubricating agents. Known dispersing 
agents include, but are not limited to, potato starch and sodium starch glycollate. 
Known surface active agents include, but are not limited to, sodium lauryl sulphate. 
Known diluents include, but are not limited to, calcium carbonate, sodium carbonate, 

25 lactose, microcrystalline cellulose, calcium phosphate, calcium hydrogen phosphate, 

and sodium phosphate. Known granulating and disintegrating agents include, but are 
not limited to, corn starch and alginic acid. Known binding agents include, but are not 
limited to, gelatin, acacia, pre-gelatinized maize starch, polyvinylpyrrolidone, and 
hydroxy propyl methyl cellulose. Known lubricating agents include, but are not 

30 limited to, magnesium stearate, stearic acid, silica, and talc. 

Tablets may be non-coated or they may be coated using known methods 



10873 v.2 



-65- 



to achieve delayed disintegration in the gastrointestinal tract of a subject, thereby 
providing sustained release and absorption of the active ingredient. By way of 
example, a material such as glyceryl monostearate or glyceryl distearate may be used to 
coat tablets. Further by way of example, tablets may be coated using methods 
5 described in U.S. Patents numbers 4,256,108; 4,160,452; and 4,265,874 to form 

osmotically-controlled release tablets. Tablets may further comprise a sweetening 
agent, a flavoring agent, a coloring agent, a preservative, or some combination of these 
in order to provide pharmaceutically elegant and palatable preparation. 

Hard capsules comprising the active ingredient may be made using a 
10 physiologically degradable composition, such as gelatin. Such hard capsules comprise 

the active ingredient, and may further comprise additional ingredients including, for 
example, an inert solid diluent such as calcium carbonate, calcium phosphate, or 
kaolin. 

Soft gelatin capsules comprising the active ingredient may be made 

15 using a physiologically degradable composition, such as gelatin. Such soft capsules 

comprise the active ingredient, which may be mixed with water or an oil medium such 
as peanut oil, liquid paraffin, or olive oil. 

Liquid formulations of a pharmaceutical composition of the invention 
which are suitable for oral administration may be prepared, packaged, and sold either 

20 in liquid form or in the form of a dry product intended for reconstitution with water or 
another suitable vehicle prior to use. 

Liquid suspensions may be prepared using conventional methods to 
achieve suspension of the active ingredient in an aqueous or oily vehicle. Aqueous 
vehicles include, for example, water and isotonic saline. Oily vehicles include, for 

25 example, almond oil, oily esters, ethyl alcohol, vegetable oils such as arachis, olive, 

sesame, or coconut oil, fractionated vegetable oils, and mineral oils such as liquid 
paraffin. Liquid suspensions may further comprise one or more additional ingredients 
including, but not limited to, suspending agents, dispersing or wetting agents, 
emulsifying agents, demulcents, preservatives, buffers, salts, flavorings, coloring 

30 agents, and sweetening agents. Oily suspensions may further comprise a thickening 
agent. Known suspending agents include, but are not limited to, sorbitol syrup, 
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hydrogenated edible fats, sodium alginate, polyvinylpyrrolidone, gum tragacanth, gum 
acacia, and cellulose derivatives such as sodium carboxymethylcellulose, methyl 
cellulose, hydroxypropylmethylcellulose. Known dispersing or wetting agents include, 
but are not limited to, naturally-occurring phosphatides such as lecithin, condensation 
5 products of an alkylene oxide with a fatty acid, with a long chain aliphatic alcohol, 

with a partial ester derived from a fatty acid and a hexitol, or with a partial ester 
derived from a fatty acid and a hexitol anhydride (e.g., poly oxy ethylene stearate, 
heptadecaethyleneoxycetanol, polyoxyethylene sorbitol monooleate, and 
polyoxyethylene sorbitan monooleate, respectively). Known emulsifying agents 

10 include, but are not limited to, lecithin and acacia. Known preservatives include, but 

are not limited to, methyl, ethyl, or n-propyl-para-hydroxybenzoates, ascorbic acid, and 
sorbic acid. Known sweetening agents include, for example, glycerol, propylene 
glycol, sorbitol, sucrose, and saccharin. Known thickening agents for oily suspensions 
include, for example, beeswax, hard paraffin, and cetyl alcohol. 

15 Liquid solutions of the active ingredient in aqueous or oily solvents may 

be prepared in substantially the same manner as liquid suspensions, the primary 
difference being that the active ingredient is dissolved, rather than suspended in the 
solvent. Liquid solutions of the pharmaceutical composition of the invention may 
comprise each of the components described with regard to liquid suspensions, it being 

20 understood that suspending agents will not necessarily aid dissolution of the active 
ingredient in the solvent. Aqueous solvents include, for example, water and isotonic 
saline. Oily solvents include, for example, almond oil, oily esters, ethyl alcohol, 
vegetable oils such as arachis, olive, sesame, or coconut oil, fractionated vegetable oils, 
and mineral oils such as liquid paraffin. 

25 Powdered and granular formulations of a pharmaceutical preparation of 

the invention may be prepared using known methods. Such formulations may be 
administered directly to a subject, used, for example, to form tablets, to fill capsules, or 
to prepare an aqueous or oily suspension or solution by addition of an aqueous or oily 
vehicle thereto. Each of these formulations may further comprise one or more of 

30 dispersing or wetting agent, a suspending agent, and a preservative. Additional 

excipients, such as fillers and sweetening, flavoring, or coloring agents, may also be 
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included in these formulations. 

A pharmaceutical composition of the invention may also be prepared, 
packaged, or sold in the form of oil-in- water emulsion or a water-in-oil emulsion. The 
oily phase may be a vegetable oil such as olive or arachis oil, a mineral oil such as 
5 liquid paraffin, or a combination of these. Such compositions may further comprise 
one or more emulsifying agents such as naturally occurring gums such as gum acacia 
or gum tragacanth, naturally-occurring phosphatides such as soybean or lecithin 
phosphatide, esters or partial esters derived from combinations of fatty acids and 
hexitol anhydrides such as sorbitan monooleate, and condensation products of such 

10 partial esters with ethylene oxide such as polyoxyethylene sorbitan monooleate. These 
emulsions may also contain additional ingredients including, for example, sweetening 
or flavoring agents. 

A pharmaceutical composition of the invention may be prepared, 
packaged, or sold in a formulation suitable for rectal administration. Such a 

15 composition may be in the form of, for example, a suppository, a retention enema 

preparation, and a solution for rectal or colonic irrigation. 

Suppository formulations may be made by combining the active 
ingredient with a non-irritating pharmaceutically acceptable excipient which is solid at 
ordinary room temperature (i.e., about 20°C) and which is liquid at the rectal 

20 temperature of the subject (i.e., about 37°C in a healthy human). Suitable 

pharmaceutically acceptable excipients include, but are not limited to, cocoa butter, 
polyethylene glycols, and various glycerides. Suppository formulations may further 
comprise various additional ingredients including, but not limited to, antioxidants and 
preservatives. 

25 Retention enema preparations or solutions for rectal or colonic irrigation 

may be made by combining the active ingredient with a pharmaceutically acceptable 
liquid carrier. As is well known in the art, enema preparations may be administered 
using, and may be packaged within, a delivery device adapted to the rectal anatomy of 
the subject. Enema preparations may further comprise various additional ingredients 

30 including, but not limited to, antioxidants and preservatives. 

A pharmaceutical composition of the invention may be prepared, 
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packaged, or sold in a formulation suitable for vaginal administration. Such a 
composition may be in the form of, for example, a suppository, an impregnated or 
coated vaginally-insertable material such as a tampon, a douche preparation, or gel or 
cream or a solution for vaginal irrigation. 
5 Methods for impregnating or coating a material with a chemical 

composition are known in the art, and include, but are not limited to methods of 
depositing or binding a chemical composition onto a surface, methods of incorporating 
a chemical composition into the structure of a material during the synthesis of the 
material (i.e., such as with a physiologically degradable material), and methods of 

10 absorbing an aqueous or oily solution or suspension into an absorbent material, with or 

without subsequent drying. 

Douche preparations or solutions for vaginal irrigation may be made by 
combining the active ingredient with a pharmaceutical^ acceptable liquid carrier. As 
is well known in the art, douche preparations may be administered using, and may be 

15 packaged within, a delivery device adapted to the vaginal anatomy of the subject. 

Douche preparations may further comprise various additional ingredients including, 
but not limited to, antioxidants, antibiotics, antifungal agents, and preservatives. 

As used herein, "parenteral administration" of a pharmaceutical 
composition includes any route of administration characterized by physical breaching 

20 of a tissue of a subject and administration of the pharmaceutical composition through 

the breach in the tissue. Parenteral administration thus includes, but is not limited to, 
administration of a pharmaceutical composition by injection of the composition, by 
application of the composition through a surgical incision, by application of the 
composition through a tissue-penetrating non-surgical wound, and the like. In 

25 particular, parenteral administration is contemplated to include, but is not limited to, 
subcutaneous, intraperitoneal, intramuscular, intrasternal injection, and kidney dialytic 
infusion techniques. 

Formulations of a pharmaceutical composition suitable for parenteral 
administration comprise the active ingredient combined with a pharmaceutically 

30 acceptable carrier, such as sterile water or sterile isotonic saline. Such formulations 
may be prepared, packaged, or sold in a form suitable for bolus administration or for 
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continuous administration. Injectable formulations may be prepared, packaged, or^old 
in unit dosage form, such as in ampules or in multi-dose containers containing a 
preservative. Formulations for parenteral administration include, but are not limited to, 
suspensions, solutions, emulsions in oily or aqueous vehicles, pastes, and implantable 
5 sustained-release or biodegradable formulations. Such formulations may further 

comprise one or more additional ingredients including, but not limited to, suspending, 
stabilizing, or dispersing agents. In one embodiment of a formulation for parenteral 
administration, the active ingredient is provided in dry (i.e., powder or granular) form 
for reconstitution with a suitable vehicle (e.g., sterile pyrogen-free water) prior to 

10 parenteral administration of the reconstituted composition. 

The pharmaceutical compositions may be prepared, packaged, or sold in 
the form of a sterile injectable aqueous or oily suspension or solution. This suspension 
or solution may be formulated according to the known art, and may comprise, in 
addition to the active ingredient, additional ingredients such as the dispersing agents, 

15 wetting agents, or suspending agents described herein. Such sterile injectable 

formulations may be prepared using a non-toxic parenterally-acceptable diluent or 
solvent, such as water or 1,3-butane diol, for example. Other acceptable diluents and 
solvents include, but are not limited to, Ringer's solution, isotonic sodium chloride 
solution, and fixed oils such as synthetic mono- or di-glycerides. Other parentally- 

20 administrable formulations which are useful include those which comprise the active 
ingredient in microcrystalline form, in a liposomal preparation, or as a component of a 
biodegradable polymer systems. Compositions for sustained release or implantation 
may comprise pharmaceutical^ acceptable polymeric or hydrophobic materials such as 
an emulsion, an ion exchange resin, a sparingly soluble polymer, or a sparingly soluble 

25 salt. 

Formulations suitable for topical administration include, but are not 
limited to, liquid or semi-liquid preparations such as liniments, lotions, oil-in- water or 
water-in-oil emulsions such as creams, ointments or pastes, and solutions or 
suspensions. Topically-administrable formulations may, for example, comprise from 
30 about 1% to about 10% (w/w) active ingredient, although the concentration of the 

active ingredient may be as high as the solubility limit of the active ingredient in the 
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solvent. Formulations for topical administration may further comprise one or more of 
the additional ingredients described herein. 

A pharmaceutical composition of the invention may be prepared, 
packaged, or sold in a formulation suitable for pulmonary administration via the buccal 
5 cavity. Such a formulation may comprise dry particles which comprise the active 

ingredient and which have a diameter in the range from about 0.5 to about 7 
nanometers, and preferably from about 1 to about 6 nanometers. Such compositions 
are conveniently in the form of dry powders for administration using a device 
comprising a dry powder reservoir to which a stream of propellant may be directed to 

10 disperse the powder or using a self-propelling solvent/powder-dispensing container 

such as a device comprising the active ingredient dissolved or suspended in a low- 
boiling propellant in a sealed container. Preferably, such powders comprise particles 
wherein at least 98% of the particles by weight have a diameter greater than 0.5 
nanometers and at least 95% of the particles by number have a diameter less than 7 

15 nanometers. More preferably, at least 95% of the particles by weight have a diameter 

greater than 1 nanometer and at least 90% of the particles by number have a diameter 
less than 6 nanometers. Dry powder compositions preferably include a solid fine 
powder diluent such as sugar and are conveniently provided in a unit dose form. 

Low boiling propellants generally include liquid propellants having a 

20 boiling point of below 65 °F at atmospheric pressure. Generally the propellant may 
constitute 50 to 99.9% (w/w) of the composition, and the active ingredient may 
constitute 0.1 to 20% (w/w) of the composition. The propellant may further comprise 
additional ingredients such as a liquid non-ionic or solid anionic surfactant or a solid 
diluent (preferably having a particle size of the same order as particles comprising the 

25 active ingredient). 

Pharmaceutical compositions of the invention formulated for pulmonary 
delivery may also provide the active ingredient in the form of droplets of a solution or 
suspension. Such formulations may be prepared, packaged, or sold as aqueous or 
dilute alcoholic solutions or suspensions, optionally sterile, comprising the active 

30 ingredient, and may conveniently be administered using any nebulization or 

atomization device. Such formulations may further comprise one or more additional 
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ingredients including, but not limited to, a flavoring agent such as saccharin sodium, a 
volatile oil, a buffering agent, a surface active agent, or a preservative such as 
methylhydroxybenzoate. The droplets provided by this route of administration 
preferably have an average diameter in the range from about 0. 1 to about 200 
5 nanometers. 

The formulations described herein as being useful for pulmonary 
delivery are also useful for intranasal delivery of a pharmaceutical composition of the 
invention. 

Another formulation suitable for intranasal administration is a coarse powder 

10 comprising the active ingredient and having an average particle from about 0.2 to 500 
micrometers. Such a formulation is administered in the manner in which snuff is 
taken, i.e., by rapid inhalation through the nasal passage from a container of the powder 
held close to the nares. 

Formulations suitable for nasai administration may, for example, 

15 comprise from about as little as 0.1% (w/w) and as much as 100% (w/w) of the active 

ingredient, and may further comprise one or more of the additional ingredients 
described herein. 

A pharmaceutical composition of the invention may be prepared, 
packaged, or sold in a formulation suitable for buccal administration. Such 

20 formulations may, for example, be in the form of tablets or lozenges made using 

conventional methods, and may, for example, 0.1 to 20% (w/w) active ingredient, the 
balance comprising an orally dissolvable or degradable composition and, optionally, 
one or more of the additional ingredients described herein. Alternately, formulations 
suitable for buccal administration may comprise a powder or an aerosolized or 

25 atomized solution or suspension comprising the active ingredient. Such powdered, 

aerosolized, or aerosolized formulations, when dispersed, preferably have an average 
particle or droplet size in the range from about 0.1 to about 200 nanometers, and may 
further comprise one or more of the additional ingredients described herein. 

A pharmaceutical composition of the invention may be prepared, 

30 packaged, or sold in a formulation suitable for ophthalmic administration. Such 

formulations may, for example, be in the form of eye drops including, for example, a 
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0.1-1 .0% (w/w) solution or suspension of the active ingredient in an aqueous or oily 
liquid carrier. Such drops may further comprise buffering agents, salts, or one or more 
other of the additional ingredients described herein. Other ophthalmalmically- 
administrable formulations which are useful include those which comprise the active 

5 ingredient in microcrystalline form or in a liposomal preparation. 

As used herein, "additional ingredients' 1 include, but are not limited to, 
one or more of the following: excipients; surface active agents; dispersing agents; inert 
diluents; granulating and disintegrating agents; binding agents; lubricating agents; 
sweetening agents; flavoring agents; coloring agents; preservatives; physiologically 

10 degradable compositions such as gelatin; aqueous vehicles and solvents; oily vehicles 
and solvents; suspending agents; dispersing or wetting agents; emulsifying agents, 
demulcents; buffers; salts; thickening agents; fillers; emulsifying agents; antioxidants; 
antibiotics; antifungal agents; stabilizing agents; and pharmaceutically acceptable 
polymeric or hydrophobic materials. Other "additional ingredients" which may be 

1 5 included in the pharmaceutical compositions of the invention are known in the art and 

described, for example in Genaro, ed. (1985, Remington's Pharmaceutical Sciences, 
Mack Publishing Co., Easton, PA), which is incorporated herein by reference. 

Typically dosages of the compound of the invention which may be 
administered to an animal, preferably a human, range in amount from 1 microgram to 

20 about 100 grams per kilogram of body weight of the animal. While the precise dosage 
administered will vary depending upon any number of factors, including but not 
limited to, the type of animal and type of disease state being treated, the age of the 
animal and the route of administration. Preferably, the dosage of the compound will 
vary from about 1 milligram to about 10 grams per kilogram of body weight of the 

25 animal. More preferably, the dosage will vary from about 10 milligrams to about 1 

gram per kilogram of body weight of the animal. 

The compound may be administered to an animal as frequently as 
several times daily, or it may be administered less frequently, such as once a day, once 
a week, once every two weeks, once a month, or even lees frequently, such as once 

30 every several months or even once a year or less. The frequency of the dose will be 

readily apparent to the skilled artisan and will depend upon any number of factors, such 
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as, but not limited to, the type and severity of the disease being treated, the type and 
age of the animal, etc. 

Methods 

The invention includes a method of stimulating snRNP assembly. The 

5 method comprises contacting an extract comprising snRNP components (e.g. , SMN, 
SmB, SmD, snRNA, and the like) with SMN. This method takes advantage of the 
novel discovery that SMN mediates snRNP assembly and is a component of the 
complex involved in snRNP assembly. 

The invention includes a method of identifying a compound which 

10 affects the level of SMN expression in a cell. The method comprises contacting a cell 
with a test compound and comparing the level of SMN expression in the cell with the 
level of SMN expression in an otherwise identical cell which is not contacted with the 
compound. A higher or lower level of SMN expression in the cell treated with the 
compound relative to the level of SMN expresison in an untreated cell indicates that 

1 5 the compound affects SMN expression. 

One skilled in the art would appreciate, based upon the disclosure 
provided herein, that such compound would be useful to treat SMA since lack or 
decreased expression of SMN in a cell is causally linked to SMA. Thus, a compound 
that affects SMN expression would be an important potential SMA therapeutic. 

20 The invention also includes a method of identifying compounds which 

affect the level of SMN in a cell wherein the cell comprises an SMN modulating 
sequence. The method comprises contacting the cell with a test compound and, 
determining whether contact with the compound affects the expression of SMN in the 
cell compared with the level of SMN expression in an otherwise identical cell not 

25 contacted with the compound. Thus, the method allows identification of a compound 
that affects SMN expression. 

A compound that increases the level of expression of SMN compared to 
the level of SMN expression in the untreated cell is potentially useful for the treatment 
of SMA which is characterized by decreased levels of SMN in afflicted individuals. 

30 Thus, the invention also includes a method of identifying a compound useful for the 
treatment of SMA. In one aspect, the method comprises contacting a cell comprising 
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an antisense nucleic acid complementary to a nucleic acid encoding SMN with a test 
compound. Then, the level of expression of SMN in the cell contacted with the 
compound is compared, using methods such as Northern blotting and the like, to the 
level of expression of SMN in an otherwise identical cell not treated with the 
5 compound. A higher level of SMN expression in the cell contacted with the compound 
compared with the level in the cell not contacted with the compound indicates that the 
compound increased SMN expression in the cell. Since decreased SMN expression is 
associated with SMA, compounds that increase SMN expression are important 
potential therapeutics to treat the disease. Thus, a method of identifying a compound 

10 that affects the level of SMN expression in a cell is an important tool in the 

development of SMA therapeutics since the disease is associated and apparently 
mediated by a defect in the level of SMN expressed and/or the ability of the SMN 
produced in the cell to participate in mRNA biogenesis. 

One skilled in the art would appreciate, based upon the disclosure 

1 5 provided herein, that until the present invention, there was no cell model of SMA 

available for in vitro testing of compounds or for studying the role of decreased or 
absent SMN expression in the SMA disease process. This cell line comprising an 
SMN antisense nucleic acid which exhibits decreased or absent SMN expression, is an 
important tool for the study of SMA and for the identification of compounds useful for 

20 treatment of the disease. 

Further, the present invention includes method of identifying a 
compound useful for treatment of SMA wherein the cell comprised an SMN 
modulating sequence that decreases the expression of endogenous SMN. The SMN 
modulating sequence, i.e., a nucleic acid encoding SMN, further comprises a tag 

25 polypeptide that renders the exogenous SMN fusion protein non-functional. Because 
expression of the non-functional SMN inhibits expression of functional endogenous 
SMN, the cell provides a useful system wherein potential SMA therapeutics can be 
identified and evaluated. Thus, the invention includes a method wherein a cell 
comprising an SMN modulating sequence which inhibits expression of endogenous 

30 SMN is contacted with a test compound. The level of SMN expression in the cell 
contacted with the compound is compared to the level of SMN expression in an 
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otherwise identical cell not contacted with the compound. A higher level of expression 
of SMN in the cell contacted with the compound compared with the level of expression 
of the otherwise identical but untreated cell is an indication that the compound is useful 
for treatement of SMA 

5 One skilled in the art would also appreciate, based upon the disclosure 

provided herein, that a cell exhibiting decreased or absent expression of SMN would 
demonstrate altered growth characteristics due to the deleterious effect(s) of decreased 
levels of SMN expression. Therefore, another cell line was produced wherein the 
expression of SMN can be controlled thereby allowing the cells to grow and divide so 

10 as to produce sufficient cells to permit testing of potential SMN candidate therapeutics. 

That is, once the cells reach sufficient numbers, the level of SMN can be selectively 
decreased or completely inhibited. 

Therefore, the invention includes a method of identifying a compound 
useful for treating SMA comprising contacting a cell that has decreased or absent 

15 levels of SMN expression thereby mimicking the disease condition of SMA. The cell 

comprises a knock-out targeting vector such that at least one or both copies of the 
SMN gene has/have been removed. The cell further comprises a copy of the SMN 
gene present within the cell on a vector {e.g. , a plasmid) under the control of an 
inducible promoter (e.g., the tetracycline repressible promoter, and the like). The 

20 promoter is induced to allow the cells to grow and divide in culture. The promoter is 
turned off such that SMN is not expressed in the cell. The cell then becomes a test 
system which emulates the condition in the cells of patients afflicted with SMA. The 
cell is then used for the method of the invention in that the cell is contacted with a test 
compound. The ability of the compound to treat SMA can then be assessed by 

25 determining the growth characteristics of the cell. That is, a compound that mediates 

cell growth is a potential SMA therapeutic since the "block" in mRNA biosynthesis 
due to decreased SMN expression has been overcome as a result of the cell having 
been contacted with the compound. Further, the ability of the compound to increase 
SMN expression can also be assessed using methods of detecting SMN mRNA such as 

30 Northern blotting and RT-PCR assay, and the like. 

In one embodiment, a chicken cell (i.e., DT40 which is a pre-B 
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lymphoid cell) was vised. This cell was used because it has a high degree of 
homologous recombination making it a useful cell line for use of a knock-out targeting 
vector which replaced the nucleic acid sequence encoding SMN through homologous 
recombination. However, the cell line need not exhibit high degree of homologous 
recombination; rather, the cell need only exhibit a degree of homologous 
recombination which allows successful use of a knock-out targeting vector which 
requires such recombination to remove all or part of the desired target SMN sequence. 

Likewise, although DT40 cell comprises only a single SMN allele, the 
present inven tion is not limited to using cells comprising a single SMN allele. Rather, 
the invention includes using cells with at least one SMN allele for the production of 
knock-out cell lines. 

The invention also includes a method of identifying an RNA splicing- 
modulating compound. As disclosed herein, splicing of RNA is diminished in the 
absence of SMN. Thus, the method of identifying an RNA splicing-modulating 
compound comprises incubating an in vitro pre-mRNA processing extract in the 
absence of SMN and in the presence or absence of a test compound, and comparing the 
level of splicing of RNA in the extract in the presence of the test compound with the 
level of splicing of the RNA in the absence of the test compound, wherein a higher or a 
lower level of RNA splicing in the extract in the presence of the test compound, 
compared with the level of RNA splicing in the extract in the absence of the test 
compound, is an indication that the test compound is an RNA splicing-modulating 
compound. In one preferred embodiment, the RNA splicing modulation is 
enhancement of RNA splicing. RNA splicing assays are disclosed herein as are 
methods of assessing splicing and include Northern blotting to identify the various 
processing species resulting from the various steps in pre-mRNA processing. 

A compound that increases RNA-splicing is useful for increasing or 
decreasing such reactions in vitro. Such a compound would be useful for treating any 
disease or condition associated with or mediated by a decrease (e.g., SMA) or increase 
in RNA-splicing. 

In one aspect, the invention includes a method of enhancing splicing of 
RNA. This method comprises incubating an in vitro pre-mRNA processing extract in 
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the presence of SMN or a mutant or variant or a fragment thereof, thereby enhancing 
splicing of the RNA. As stated previously herein, a compound that enhances RNA 
splicing would be useful for methods requiring such RNA splicing. Further, such a 
compound would be a useful candidate therapeutic for treatment of a disease or 

5 condition associated with or mediated by a decrease in RNA splicing. 

Further included in the invention is a method of assessing the presence 
or degree of SMA in a mammal. This method is useful to diagnose SMA and is also 
useful for assessing the progress of SMA or the efficacy of treatment in an SMA 
patient. The method comprises obtaining a biopsy comprising motor neurons from the 

10 mammal and assessing the number and morphology of gems in the motor neurons, 

wherein a lower number of gems in the motor neurons, compared with the number of 
gems in motor neurons obtained from an otherwise identical mammal which does not 
have SMA, is an indication that the mammal has SMA, and further wherein the 
absence of or the presence of a minimal number of gems in the mammal having SMA 

15 is directly related to the severity of the SMA in the mammal. Such a method would be 

useful in the diagnosis of SMA and in providing an assay for following up on the 
efficacy of SMA treatments. That is, the number of gems present in the neurons of 
SMA patients could be evaluated before, during and after treatment and the efficacy of 
the treatment could thus be assessed. Further, such a method allows a determination of 

20 the presence and/or the severity of the disease in a mammal. 

Further, the invention includes a method of assessing the level of 
binding of SMN obtained from a mammal to determine the presence or degree of 
SMA in a mammal. The method takes advantage of the disclosure provided herein that 
the SMN of SMA patients exhibits decreased binding with SMN-associated protein 

25 (e.g., SMN, SIP1, and Gemin3). Thus, the method compares the level of binding of 

SMN obtained from a mammal (such as by obtaining SMN from cultured cells or a 
biopsy obtained from the mammal to an SMN-associated protein with the level of 
binding of SMNwt with an identical SMN-associated protein. Such binding assay can 
be performed in vitro using the methods disclosed herein (e.g., immobilized GST-SIP 1 

30 or GST-Gemin3 is contacted with the SMN obtained from the patient). The amount of 

SMN specifically bound to the immobilized protein is determined using, for example, 
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antibodies that bind specifically to SMN, but other methods of assessing the amount of 
bound SMN are included in the invention, such as using radiolabeled proteins to detect 
the amount bound to the surface. 

The invention includes a method of identifying a test compound that 
5 affects binding of SMN with a protein that specifically binds with SMN (/. e. , an SMN- 
associated protein such as another SMN, SIP1, Gemin3, SmB, SmB', SmDl, SmD2, 
and SmD3, and the like). One skilled in the art, based upon the disclosure provided 
herein, would appreciate that the protein binding assays disclosed herein can be used to 
compare the binding of a labeled SMN-associated protein to immobilized SMN as 

10 described in the various embodiments exemplified herein. The method of 

immobilizing the target protein and the identity of the SMN-associated protein or the 
label present thereon are not crucial and any number of labels (e.g. , tag polypeptide 
epitope, radiolabels, and the like), surfaces (e.g., glutathione-Sepharose, nickel 
columns), and SMN-associated proteins (e.g., SMN, or a mutant, variant or derivative 

1 5 thereof, SIP1 , and Gemin3) can all be employed in the method. 

As discussed previously elsewhere herein, a compound that affects 
binding of SMN with a protein that specifically binds with SMN is an important 
candidate SMA therapeutic since the binding of SMN to its associated proteins in SMA 
patients has been demonstrated, by the data disclosed herein, to be affected. Thus, the 

20 instant method provides an important assay in the development of useful compounds 
for the treatment of SMA. 

The invention includes a method of assessing whether a test compound 
is useful for treatment of SMA. In essence the method identifies a test compound that 
increases binding of SMN with a protein that specifically binds with SMN (i.e., an 

25 SMN-associated protein such as another SMN, SIP1, Gemin3, SmB, SmB', SmDl, 
SmD2, and SmD3, and the like). One skilled in the art, based upon the disclosure 
provided herein, would appreciate that the protein binding assays disclosed herein can 
be used to compare the binding of a labeled SMN-associated protein to immobilized 
SMN as described in the various embodiments exemplified herein. Thus, the method 

30 comprises making a preparation comprising the test compound, a labeled protein that 
specifically binds with SMN (e.g., another SMN, SIP1, Gemin3, SmB, SmB', SmDl, 
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SmD2, and SmD3, and the like), and a surface comprising at least a portion of SMN 
bound thereon. Preferably, the portion of SMN bound thereon comprises a portion of 
SMN that specifically binds the protein that specifically binds SMN used in the 
method. 

One skilled in the art would appreciate, based upon the disclosure 
provided herein, that the method of immobilizing the target protein and the identity of 
the SMN-associated protein or the label present thereon are not crucial factors in the 
assay such that any number of labels {e.g., tag polypeptide epitope, radiolabels, and the 
like), surfaces {e.g., glutathione-Sepharose, nickel columns, immunoaffinity columns), 
and SMN-associated proteins {e.g., SMN, or a mutant, variant or derivative thereof, 
SIP1, and Gemin3) can all be employed in the method. 

As discussed previously elsewhere herein, a compound that affects 
binding of SMN with a protein that specifically binds with SMN is an important 
candidate SMA therapeutic since the binding of SMN to its associated proteins in SMA 
patients has been demonstrated, by the data disclosed herein, to be affected. Thus, the 
instant method provides an important assay in the development of useful compounds 
for the treatment of SMA. 

A method of enhancing splicing of mRNA. The method comprises 
incubating an in vitro pre-mRNA processing extract in the presence of SMN, or any 
mutant, derivative, variant, and fragment thereof, thereby enhancing splicing of the 
mRNA. Such extract is disclosed herein and/or it can be any crude nuclear cell extract 
capable pre-mRNA processing. Further, the substrate used in the method can be any 
substrate that when processed yields a distinguishable processing product such that the 
activity of the extract can be assessed. Such substrates include the pre-mRNA 
processing substrates disclosed elsewhere herein as well as others well known in the 
art. 

The invention includes a method of identifying a compound that affects 
pre-mRNA splicing. The method comprises incubating an extract capable of pre- 
mRNA splicing such as those disclosed elsewhere hererin or other extracts well-known 
in the art, in the presence or absence of a test compound and comparing the level of 
pre-mRNA splicing in the in the absence and presence of the compound where a higher 
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or lower level of pre-mRNA splicing in the treated extract is an indication that the test 
compound affects pre-mRNA splicing. 

The invention further includes a method of identifying a test compound 
that is useful to treat SMA. The method comprises incubating an extract capable of 
pre-mRNA splicing in the presence or absence of a test compound and comparing the 
level of pre-mRNA splicing in the extract in the presence of the test compound with the 
level of splicing of pre-mRNA in the absence of the compound. A higher level of pre- 
mRNA splicing in the extract in the presence of the test compound compared with the 
level of pre-mRNA splicing in the extract not comprising the test compound, is an 
indication that the test compound is useful to treat SMA since decreased pre-mRNA 
splicing is associated with SMA as disclosed, for the first time, herein. Thus, a 
compound that increases pre-mRNA splicing, which is reduced in SMA, is a potential 
SMA therapeutic. 

Similarly, a compound that increases snRNP assembly, pre-mRNA 
splicing, SMN binding with an SMN associated protein (e.g., another SMN, SIP1, 
Gemin3, and the like), is useful for treatment of SMA in that each of these activities 
are causally linked to SMA. 

Thus, the invention includes a method of identifying a compound that 
affects snRNP assembly. The method comprises incubating an extract capable of 
snRNP assembly in the presence or absence of a test compound and comparing the 
level of snRNP assembly in the extract comprising the test compound to the level of 
snRNP assembly in the extract which does not contain the compound. A higher or 
lower level of snRNP assembly in the extract containing the compound to the extract 
not containing the compound is an indication that the test compound affects snRNP 
assembly. 

The invention further includes a method of identifying a test compound 
that is useful to treat SMA. The method comprises incubating an extract capable of 
snRNP assembly in the presence or absence of a test compound and comparing the 
level of snRNP assembly in the extract containg the test compound with the level of 
snRNP assembly in the extract without the compound. A higher level of snRNP 
assembly in the extract containing the test compound compared with the level of 
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snRNP assembly in the extract without it is an indication that the test compound is 
useful to treat SMA. 

The invention includes a method of assessing the presence or degree of 
SMA in a mammal. The method comprises comparing the level of binding of SMN 
obtained from the mammal to a protein that specifically binds with SMN with the level 
of binding of SMN wild type to an identical protein that specifically binds with SMN, 
wherein a lower level of binding of the SMN from the mammal to the protein that 
specifically binds with SMN compared with the level of binding of SMN wild type 
with the identical protein that specifically binds with SMN is an indication of the 
presence or degree of SMA in a mammal. This is because, as the data disclosed herein 
demonstrate, the SMN of SMA patients exhibits decreased binding to itself, and to 
other SMN associated proteins, than wild type SMN. 

The invention further includes a method of identifying a compound that 
affects SMN expression in a cell. The method comprises contacting as cell as 
described herein with a test compound and comparing the level of SMN expression in 
the cell with the level of SMN expression in an otherwise identical cell which is not 
contacted with the test compound. A higher or lower level of SMN expression in the 
cell contacted with the test compound compared with the level of SMN expression in 
the otherwise identical cell which is not contacted with the compound is an indication 
that the test compound affects SMN expression in a cell, thereby identifying a 
compound that affects SMN expression in a cell. 

Also included in the invention is a method of identifying a compound 
that is useful to treat SMA. This method comprises contacting a cell as described 
herein with a test compound and comparing the level of SMN expression in the cell 
with the level of SMN expression in an otherwise identical cell which is not contacted 
with the test compound. A higher level of SMN expression in the cell contacted with 
the test compound compared with the level of SMN expression in the therwise identical 
cell which is not contacted with the test compound is an indication that the test 
compound increases SMN expression in a cell, thereby identifying a compound that is 
useful to treat SMA. 

In addition, there is included in the invention a method of identifying a 
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compound useful for the treatment of SMA. This method comprises contacting a cells 
as described herein with a test compound and comparing the level of growth of the cell 
with the level of growth of an otherwise identical cell which is not contacted with the 
test compound. A higher level of growth of the cell contacted with the test compound 
5 compared with the level of growth of the cell which is not contacted with the 

compound is an indication that the compound is useful to treat SMA. 
Definitions 

As used herein, each of the following terms has the meaning associated 
with it in this section. 

10 The articles "a" and "an" are used herein to refer to one or to more than 

one (i.e., to at least one) of the grammatical object of the article. By way of example, 
"an element" means one element or more than one element. 

As used herein, the term "adjacent" is used to refer to nucleotide 
sequences which are directly attached to one another, having no intervening 

15 nucleotides. By way of example, the pentanucleotide 5 ? -AAAAA-3 t is adjacent the 

trinucleotide 5-TTT-3 1 when the two are connected thus: 5 ' - AAAAATTT- 3 ' or 5'- 
TTTAAAAA-3', but not when the two are connected thus: 5'-AAAAACTTT-3 f . 
As used herein, amino acids are represented by the full name thereof, by the three letter 
code corresponding thereto, or by the one-letter code corresponding thereto, as 

20 indicated in the following table: 



Full Name 


Three-Letter Code 


One-Letter 


Aspartic Acid 


Asp 


D 


Glutamic Acid 


Glu 


E 


Lysine 


Lys 


K 


Arginine 


Arg 


R 


Histidine 


His 


H 


Tyrosine 


Tyr 


Y 


Cysteine 


Cys 


C 


Asparagine 


Asn 


N 


Glutamine 


Gin 


Q 


Serine 


Ser 


S 
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Threonine 


Thr 


T 


Glycine 


Gly 


G 


Alanine 


Ala 


A 


Valine 


Val 


v 


Leucine 


Leu 


L 


Isoleucine 


He 


I 


IVIC till VJI11I1C 


iVICl 


1V1 


Proline 


Pro 


p 


Phenylalanine 


Phe 


F 


Tryptophan 


Tip 


W 



As used herein, to "alleviate" SMA means reducing the severity of the 
symptoms of the disease or disorder. This includes, but is not limited to, increasing the 
level of binding of SMN with SIP1, Gemin3, and/or another SMN in a patient afflicted 

1 5 with SMA compared with the level of binding of SMN to these proteins in the patient 
prior to or in the absence of the method of treatment. 

By the term "altered growth characteristics," as the term is used herein, 
is meant any variation in growth exhibited by a cell compared with an otherwise 
identical cell into which the SMN modulating sequence has not been introduced. 

20 " Antisense" refers particularly to the nucleic acid sequence of the non- 

coding strand of a double stranded DNA molecule encoding a protein, or to a sequence 
which is substantially homologous to the non-coding strand. As defined herein, an 
antisense sequence is complementary to the sequence of a double stranded DNA 
molecule encoding a protein. It is not necessary that the antisense sequence be 

25 complementary solely to the coding portion of the coding strand of the DNA molecule. 

The antisense sequence may be complementary to regulatory sequences specified on 
the coding strand of a DNA molecule encoding a protein, which regulatory sequences 
control expression of the coding sequences. 

By "biological activity," as the term is used herein, is meant that the 

30 protein has the ability to interact with its associated protein(s) and effectuate its normal 
function(s) within the cell. In a preferred embodiment, the SMN linked to BSA retains 
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its biological activity in that the protein retained its ability to bind avidly to SIP1, as 
well as the ability to bind to the Sm proteins, and mediate formation of the Sm core 
domain and to participate in its role in mRNA biogenesis. Further, biological activity 
as it refers to any form or fragment of SMN, SIP1, and Gemin3 means that the protein 
5 has the ability to bind to SMN, SIP1, and/or Gemin3 and/or to participate in mRNA 
biogenesis. 

By "complementary to a portion or all of the nucleic acid encoding 
SMN" is meant a sequence of nucleic acid which does not encode SMN protein. 
Rather, the sequence which is being expressed in the cells is identical to the non-coding 

10 strand of the nucleic acid encoding SMN and thus, does not encode SMN protein. 

The terms "complementary" and "antisense" as used herein, are not 
entirely synonymous. "Antisense" refers particularly to the nucleic acid sequence of 
the non-coding strand of a double stranded DNA molecule encoding a protein, or to a 
sequence which is substantially homologous to the non-coding strand. 

15 "Complementary" as used herein refers to the broad concept of subunit sequence 

complementarity between two nucleic acids, e.g., two DNA molecules. When a 
nucleotide position in both of the molecules is occupied by nucleotides normally 
capable of base pairing with each other, then the nucleic acids are considered to be 
complementary to each other at this position. Thus, two nucleic acids are 

20 complementary to each other when a substantial number (at least 50%) of 

corresponding positions in each of the molecules are occupied by nucleotides which 
normally base pair with each other {e.g., A:T and G:C nucleotide pairs). As defined 
herein, an antisense sequence is complementary to the sequence of a double stranded 
DNA molecule encoding a protein. It is not necessary that the antisense sequence be 

25 complementary solely to the coding portion of the coding strand of the DNA molecule. 

The antisense sequence may be complementary to regulatory sequences specified on 
the coding strand of a DNA molecule encoding a protein, which regulatory sequences 
control expression of the coding sequences. 

A "coding region" of a gene consists of the nucleotide residues of the 

30 coding strand of the gene and the nucleotides of the non-coding strand of the gene 

which are homologous with or complementary to, respectively, the coding region of an 
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mRNA molecule which is produced by transcription of the gene. 

A "coding region" of an mRNA molecule also consists of the nucleotide 
residues of the mRNA molecule which are matched with an anticodon region of a 
transfer RNA molecule during translation of the mRNA molecule or which encode a 
5 stop codon. The coding region may thus include nucleotide residues corresponding to 
amino acid residues which are not present in the mature protein encoded by the mRNA 
molecule (e.g., amino acid residues in a protein export signal sequence). 

"Encoding" refers to the inherent property of specific sequences of 
nucleotides in a polynucleotide, such as a gene, a cDNA, or an mRNA, to serve as 

10 templates for synthesis of other polymers and macromolecules in biological processes 
having either a defined sequence of nucleotides (i.e. 9 rRNA, tRNA and mRNA) or a 
defined sequence of amino acids and the biological properties resulting therefrom. 
Thus, a gene encodes a protein if transcription and translation of mRNA corresponding 
to that gene produces the protein in a ceil or other biological system. Both the coding 

15 strand, the nucleotide sequence of which is identical to the mRNA sequence and is 

usually provided in sequence listings, and the non-coding strand, used as the template 
for transcription of a gene or cDNA, can be referred to as encoding the protein or other 
product of that gene or cDNA. 

Unless otherwise specified, a "nucleotide sequence encoding an amino 

20 acid sequence" includes all nucleotide sequences that are degenerate versions of each 

other and that encode the same amino acid sequence. Nucleotide sequences that 
encode proteins and RNA may include introns. 

"Expression vector" refers to a vector comprising a recombinant 
polynucleotide comprising expression control sequences operatively linked to a 

25 nucleotide sequence to be expressed. An expression vector comprises sufficient cis- 

acting elements for expression; other elements for expression can be supplied by the 
host cell or in an in vitro expression system. Expression vectors include all those 
known in the art, such as cosmids, plasmids (e.g., naked or contained in liposomes) and 
viruses (e.g., retroviruses, adenoviruses, and adeno-associated viruses) that incorporate 

30 the recombinant polynucleotide. 

A first region of an oligonucleotide "flanks" a second region of the 
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oligonucleotide if the two regions are adjacent one another or if the two regions are 
separated by no more than about 1000 nucleotide residues, and preferably no more than 
about 100 nucleotide residues. 

As" used herein, the term "fragment" as applied to a nucleic acid, may 
5 ordinarily be at least about 20 nucleotides in length, typically, at least about 100 

nucleotides, more typically, from about 100 to about 500 nucleotides, typically at least 
about forty contiguous amino acids, preferably at least about 500 to about 1,000 
nucleotides, even more preferably at least about 1 ,000 nucleotides to about 2,000 
nucleotides, yet even more preferably at least about 2,000 to about 3,500, and most 
10 preferably, the nucleic acid fragment will be greater than about 3,500 nucleotides in 

length. 

As used herein, the term "fragment" as applied to a polypeptide, may 
ordinarily be at least about seven contiguous amino acids, typically, at least about 
fifteen contiguous amino acids, more typically, at least about thirty contiguous amino 
15 acids, typically at least about forty contiguous amino acids, preferably at least about 

fifty amino acids, even more preferably at least about sixty amino acids and most 
preferably, the peptide fragment will be greater than about sixty contiguous amino 
acids in length. 

As applied to a protein, a "fragment" of SIP1 is about 50 amino acids in 
20 length. More preferably, the fragment of a SIP1 is about 100 amino acids, even more 

preferably, at least about 200, yet more preferably, at least about 300, even more 
preferably, at least about 400, yet more preferably, at least about 500, and more 
preferably, at least about 600 amino acids in length. 

A "genomic DNA" is a DNA strand which has a nucleotide sequence 
25 homologous with a gene. By way of example, both a fragment of a chromosome and a 

cDNA derived by reverse transcription of a mammalian mRNA are genomic DNAs. 
"Homologous" as used herein, refers to the subunit sequence similarity between two 
polymeric molecules, e.g., between two nucleic acid molecules, e.g., two DNA 
molecules or two RNA molecules, or between two polypeptide molecules. When a 
30 subunit position in both of the two molecules is occupied by the same monomeric 

subunit, e.g., if a position in each of two DNA molecules is occupied by adenine, then 
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they are homologous at that position. The homology between two sequences is a direct 
function of the number of matching or homologous positions, e.g., if half (e.g., five 
positions in a polymer ten subunits in length) of the positions in two compound 
sequences are homologous then the two sequences are 50% homologous, if 90% of the 
positions, e.g., 9 of 10, are matched or homologous, the two sequences share 90% 
homology. By way of example, the DNA sequences B'ATTGCCS' and 3TATGGC 
share 50% homology. 

As used herein, "homology" is used synonymously with "identity." 
In addition, when the term "homology" is used herein to refer to the nucleic acids and 
proteins, it should be construed to be applied to homology at both the nucleic acid and 
the amino acid levels. 

Percent identity of one polynucleotide or polypeptide with respect to 
another polynucleotide or polypeptide may be determined using any available 
algorithm, such as the BLAST program as described in Altschul et al. (1990, J. Mol. 
Biol. 215:403-410). 

An "isolated nucleic acid" refers to a nucleic acid segment or fragment 
which has been separated from sequences which flank it in a naturally occurring state, 
e.g., ?i DNA fragment which has been removed from the sequences which are normally 
adjacent to the fragment, e.g., the sequences adjacent to the fragment in a genome in 
which it naturally occurs. The term also applies to nucleic acids which have been 
substantially purified from other components which naturally accompany the nucleic 
acid, e.g., RNA or DNA or proteins, which naturally accompany it in the cell. The 
term therefore includes, for example, a recombinant DNA which is incorporated into a 
vector, into an autonomously replicating plasmid or virus, or into the genomic DNA of 
a prokaryote or eukaryote, or which exists as a separate molecule (e.g., as a cDNA or a 
genomic or cDNA fragment produced by PCR or restriction enzyme digestion) 
independent of other sequences. It also includes a recombinant DNA which is part of a 
hybrid gene encoding additional polypeptide sequence. 

In the context of the present invention, the following abbreviations for 
the commonly occurring nucleic acid bases are used. "A" refers to adenosine, "C" 
refers to cytidine, "G" refers to guanosine, "T" refers to thymidine, and "U" refers to 
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uridine. 

"Mutants," "derivatives," and "variants" of the peptides of the invention 
(or of the DNA encoding the same) are peptides which may be altered in one or more 
amino acids (or in one or more base pairs) such that the peptide (or DNA) is not 
identical to the sequences recited herein, but has the same property as the peptides 
disclosed herein, in that the peptide has the property of binding to SMN, SIP1, Sm 
proteins and/or functioning in mRNA biogenesis. 

By describing two polynucleotides as "operably linked" is meant that a 
single-stranded or double-stranded nucleic acid moiety comprises the two 
polynucleotides arranged within the nucleic acid moiety in such a manner that at least 
one of the two polynucleotides is able to exert a physiological effect by which it is 
characterized upon the other. By way of example, a promoter operably linked to the 
coding region of a gene is able to promote transcription of the coding region. 
Preferably, when the nucleic acid encoding the desired protein further comprises a 
promoter/regulatory sequence, the promoter/regulatory is positioned at the 5' end of the 
desired protein coding sequence such that it drives expression of the desired protein in 
a cell. Together, the nucleic acid encoding the desired protein and its 
promoter/regulatory sequence comprise a "transgene." 

As used herein, the term "pharmaceutically acceptable carrier" means a 
chemical composition with which the active ingredient may be combined and which, 
following the combination, can be used to administer the active ingredient to a subject. 

As used herein, the term "physiologically acceptable" ester or salt 
means an ester or salt form of the active ingredient which is compatible with any other 
ingredients of the pharmaceutical composition, which is not deleterious to the subject 
to which the composition is to be administered. 

As used herein, the term "promoter/regulatory sequence" means a 
nucleic acid sequence which is required for expression of a gene product operably 
linked to the promoter/regulatory sequence. In some instances, this sequence may be 
the core promoter sequence and in other instances, this sequence may also include an 
enhancer sequence and other regulatory elements which are required for expression of 
the gene product. The promoter/regulatory sequence may, for example, be one which 
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expresses the gene product in a tissue specific manner. 

A "constitutive" promoter is a nucleotide sequence which, when 
operably linked with a polynucleotide which encodes or specifies a gene product, 
causes the gene product to be produced in a cell under most or all physiological 
5 conditions of the cell. 

An "inducible" promoter is a nucleotide sequence which, when operably 
linked with a polynucleotide which encodes or specifies a gene product, causes the 
gene product to be produced in a cell substantially only when an inducer which 
corresponds to the promoter is present in the cell. 
10 A "tissue-specific" promoter is a nucleotide sequence which, when 

operably linked with a polynucleotide which encodes or specifies a gene product, 
causes the gene product to be produced in a cell substantially only if the cell is a cell of 
the tissue type corresponding to the promoter. 

The term "expression of a nucleic acid " as used herein means the 
1 5 synthesis of the protein product encoded by the nucleic acid. 

The use of the term "DNA encoding" should be construed to include the 
DNA sequence which encodes the desired protein and any necessary 5' or 3' 
untranslated regions accompanying the actual coding sequence. 

By the term "positioned at the 5' end" as used herein, is meant that the 
20 promoter/regulatory sequence is covalently bound to the 5' end of the nucleic acid 

whose expression it regulates, at a position sufficiently close to the 5' start site of 
transcription of the nucleic acid so as to drive expression thereof. 

The direction of 5 ? to 3' addition of nucleotides to nascent RNA 
transcripts is referred to as the transcription direction. The DNA strand having the 
25 same sequence as an mRNA is referred to as the "coding strand"; sequences on the 

DNA strand which are located 5' to a reference point on the DNA are referred to as 
"upstream sequences"; sequences on the DNA strand which are 3' to a reference point 
on the DNA are referred to as "downstream sequences." 

A "portion" of a polynucleotide means at least at least about twenty 
30 sequential nucleotide residues of the polynucleotide. It is understood that a portion of a 
polynucleotide may include every nucleotide residue of the polynucleotide. 
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A "polyadenylation sequence" is a polynucleotide sequence which 
directs the addition of a poly A tail onto a transcribed messenger RNA sequence. 

A "polynucleotide" means a single strand or parallel and anti-parallel 
strands of a nucleic acid. Thus, a polynucleotide may be either a single-stranded or a 
5 double-stranded nucleic acid. 

The term lf nucleic acid" typically refers to large polynucleotides. 
The term "oligonucleotide" typically refers to short polynucleotides, generally, no 
greater than about 50 nucleotides. It will be understood that when a nucleotide 
sequence is represented by a DNA sequence (i.e., A, T, G, C), this also includes an 
10 RNA sequence (i.e., A, U, G, C) in which "U" replaces "T." 

Conventional notation is used herein to describe polynucleotide sequences: the left- 
hand end of a single-stranded polynucleotide sequence is the 5-end; the left-hand 
direction of a double-stranded polynucleotide sequence is referred to as the 5'- 
direction. 

15 "Primer" refers to a polynucleotide that is capable of specifically 

hybridizing to a designated polynucleotide template and providing a point of initiation 
for synthesis of a complementary polynucleotide. Such synthesis occurs when the 
polynucleotide primer is placed under conditions in which synthesis is induced, i.e., in 
the presence of nucleotides, a complementary polynucleotide template, and an agent 

20 for polymerization such as DNA polymerase. A primer is typically single-stranded, 

but may be double-stranded. Primers are typically deoxyribonucleic acids, but a wide 
variety of synthetic and naturally occurring primers are useful for many applications. 
A primer is complementary to the template to which it is designed to hybridize to serve 
as a site for the initiation of synthesis, but need not reflect the exact sequence of the 

25 template. In such a case, specific hybridization of the primer to the template depends 

on the stringency of the hybridization conditions. Primers can be labeled with, e.g., 
chromogenic, radioactive, or fluorescent moieties and used as detectable moieties. 

"Probe" refers to a polynucleotide that is capable of specifically 
hybridizing to a designated sequence of another polynucleotide. A probe specifically 

30 hybridizes to a target complementary polynucleotide, but need not reflect the exact 

complementary sequence of the template. In such a case, specific hybridization of the 
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probe to the target depends on the stringency of the hybridization conditions. Probes 
can be labeled with, e.g., chromogenic, radioactive, or fluorescent moieties and used as 
detectable moieties. 

By the term "promoter/regulatory sequence" is meant a DNA sequence 

5 which is required for expression of a nucleic acid operably linked to the 

promoter/regulatory sequence. In some instances, the promoter/regulatory sequence 
may function in a tissue specific manner, in that, the promoter/regulatory sequence is 
only capable of driving expression in a cell of a particular tissue type. In some 
instances, this sequence may be the core promoter sequence and in other instances, this 

10 sequence may also include an enhancer sequence and other regulatory elements which 

are required for expression in a tissue-specific manner. 

"Recombinant polynucleotide" refers to a polynucleotide having 
sequences that are not naturally joined together. An amplified or assembled 
recombinant polynucleotide may be included in a suitable vector, and the vector can be 

1 5 used to transform a suitable host cell. 

A recombinant polynucleotide may serve a non-coding function {e.g., 
promoter, origin of replication, ribosome-binding site, etc.) as well. 

A "recombinant polypeptide" is one which is produced upon expression 
of a recombinant polynucleotide. 

20 "Polypeptide" refers to a polymer composed of amino acid residues, 

related naturally occurring structural variants, and synthetic non-naturally occurring 
analogs thereof linked via peptide bonds, related naturally occurring structural variants, 
and synthetic non-naturally occurring analogs thereof. Synthetic polypeptides can be 
synthesized, for example, using an automated polypeptide synthesizer. 

25 The term "protein" typically refers to large polypeptides. 

The term "peptide" typically refers to short polypeptides. 
Conventional notation is used herein to portray polypeptide sequences: 
the left-hand end of a polypeptide sequence is the amino-terminus; the right-hand end 
of a polypeptide sequence is the carboxyl-terminus. 

30 As used herein, the term "reporter gene" means a gene, the expression of 

which can be detected using a known method. By way of example, the Escherichia 
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coli lacZ gene may be used as a reporter gene in a medium because expression of the 
lacZ gene can be detected using known methods by adding the chromogenic substrate 
o-nitrophenyl-/?-galactoside to the medium (Gerhardt et al, eds., 1994, Methods for 
General and Molecular Bacteriology, American Society for Microbiology, Washington, 
5 DC, p. 574). 

A "restriction site" is a portion of a double-stranded nucleic acid which 
is recognized by a restriction endonuclease. 

A portion of a double-stranded nucleic acid is "recognized" by a 
restriction endonuclease if the endonuclease is capable of cleaving both strands of the 
10 nucleic acid at the portion when the nucleic acid and the endonuclease are contacted. 

By the term "specifically binds," as used herein, is meant a compound, 
e.g., a protein, a nucleic acid, an antibody, and the like, which recognizes and binds a 
specific molecule, but does not substantially recognize or bind other molecules in a 
sample. 

15 A first oligonucleotide anneals with a second oligonucleotide "with high 

stringency" if the two oligonucleotides anneal under conditions whereby only 
oligonucleotides which are at least about 75%, and preferably at least about 90% or at 
least about 95%, complementary anneal with one another. The stringency of 
conditions used to anneal two oligonucleotides is a function of, among other factors, 

20 temperature, ionic strength of the annealing medium, the incubation period, the length 
of the oligonucleotides, the G-C content of the oligonucleotides, and the expected 
degree of non-homology between the two oligonucleotides, if known. Methods of 
adjusting the stringency of annealing conditions are known {see, e.g., Sambrook et al., 
1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 

25 New York). 

By the term "SMN modulating sequence," as used herein, is meant a 
nucleic acid sequence which when introduced into a cell affects the level of expression 
of SMN protein. Such altered level of SMN expression may, but the invention does 
not require, cause the cell to exhibit altered growth characteristics or altered formation 
30 of snRNP complexes compared with an otherwise identical cell into which the nucleic 
acid sequence has not been introduced. The nucleic acid sequence may be a DNA 
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molecule in either a sense or antisense orientation with respect to the SMN coding 
sequence or the nucleic acid sequence may be, inter alia, a ribozyme specific for 
human SMN. 

As used herein, the term "substantially pure" describes a compound, 
5 e.g., a nucleic acid, protein or polypeptide, which has been separated from components 
which naturally accompany it. Typically, a compound is substantially pure when at 
least about 10%, preferably at least about 20%, more preferably at least about 50%, 
still more preferably at least about 75%, even more preferably at least about 90%, and 
most preferably at least about 99% of the total material (by volume, by wet or dry 
10 weight, or by mole percent or mole fraction) in a sample is the compound of interest. 

Purity can be measured by any appropriate method, e.g., by column chromatography, 
gel electrophoresis or HPLC analysis. 

A compound, e.g. , a nucleic acid, a protein or polypeptide is also 
"substantially purified" when it is essentially free of naturally associated components 
1 5 or when it is separated from the native contaminants which accompany it in its natural 

state. Thus, a "substantially pure" preparation of a nucleic acid, as used herein, refers 
to a nucleic acid sequence which has been purified from the sequences which flank it in 
a naturally occurring state, e.g. , a DNA fragment which has been removed from the 
sequences which are normally adjacent to the fragment in a genome in which it 
20 naturally occurs. 

Similarly, a "substantially pure" preparation of a protein or a 
polypeptide, as used herein, refers to a protein or polypeptide which has been purified 
from components with which it is normally associated in its naturally occurring state. 
A substantially pure peptide can be purified by following known procedures for protein 
25 purification, wherein an immunological, enzymatic or other assay is used to monitor 

purification at each stage in the procedure. Protein purification methods are well 
known in the art, and are described, for example in Deutscher et al. (1990, In: Guide to 
Protein Purification . Harcourt Brace Jovanovich, San Diego). 

As used herein, the term "transgene" means an exogenous nucleic acid 
30 sequence which exogenous nucleic acid is encoded by a transgenic cell or mammal. 

By the term "exogenous nucleic acid" is meant that the nucleic acid has 
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been introduced into a cell or an animal using technology which has been developed 
for the purpose of facilitating the introduction of a nucleic acid into a cell or an animal. 

A cell that comprises a transgene is referred to as a "recombinant cell." 
Such a cell may be a eukaryotic cell or a prokaryotic cell. A gene which is expressed 
in a recombinant cell wherein the gene comprises a recombinant polynucleotide, 
produces a "recombinant polypeptide." 

By the term "SMN-associated protein" as used herein, is meant a 
protein that specifically binds with SMN. Such proteins include another SMN, an 
SIP1, a Gemin3, and various Sm proteins (e.g., SmB, SmB', SmDl, SmD2, and 
SmD3). 

By "tag" polypeptide is meant any protein which, when linked by a 
peptide bond to a protein of interest, may be used to localize the protein, to purify it 
from a cell extract, to immobilize it for use in binding assays, or to otherwise study its 
biological properties and/or function. A chimeric (i.e., fusion) protein containing a 
"tag" epitope can be immobilized on a resin which binds the tag. Such tag epitopes and 
resins which specifically bind them are well known in the art and include, for example, 
tag epitopes comprising a plurality of sequential histidine residues (His6), which 
allows isolation of a chimeric protein comprising such an epitope on nickel- 
nitrilotriacetic acid-agarose, a hemagglutinin (HA) tag epitope allowing a chimeric 
protein comprising such an epitope to bind with an anti-HA-monoclonal antibody 
affinity matrix, a myc tag epitope allowing a chimeric protein comprising such an 
epitope to bind with an anti-myc-monoclonal antibody affinity matrix, a glutathione-S- 
transferase tag epitope, and a maltose binding protein (MBP) tag epitope, which can 
induce binding between a protein comprising such an epitope and a glutathione- or 
maltose-Sepharose column, respectively. Production of proteins comprising such tag 
epitopes is well known in the art and is described in standard treatises such as 
Sambrook et al., 1989, and Ausubel et al, supra. Likewise, antibodies to the tag 
epitope (e.g., anti-HA, anti-myc antibody 9E10, and the like) allow detection and 
localization of the fusion protein in, for example, Western blots, ELISA assays, and 
immunostaining of cells. 

As used herein, to "treat" means reducing the frequency with which 
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symptoms of SMA are experienced by a patient. 

By the term "vector" as used herein, is meant any plasmid or virus 
encoding an exogenous nucleic acid. The term should also be construed to include 
non-plasmid and non-viral compounds which facilitate transfer of nucleic acid into 
5 virions or cells, such as, for example, polylysine compounds and the like. The vector 

may be a viral vector which is suitable as a delivery vehicle for delivery of the SMN, 
SIP1, Gemin3, or all of the aforementioned, protein or a nucleic acid encoding SMN, 
SIP1, Gemini to the patient, or the vector may be a non-viral vector which is suitable 
for the same purpose. Examples of viral and non- viral vectors for delivery of DNA to 

10 cells and tissues are well known in the art and are described, for example, in Ma et al. 

(1997, Proc. Natl. Acad. Sci. U.S.A. 94:12744-12746). Examples of viral vectors 
include, but are not limited to, a recombinant vaccinia virus, a recombinant adenovirus, 
a recombinant retrovirus, a recombinant adeno-associated virus, a recombinant avian 
pox virus, and the like (Cranage et al., 1986, EMBO J. 5:3057-3063; International 

1 5 Patent Application No. W094/1 78 1 0, published August 18,1 994; International Patent 

Application No. W094/23744, published October 27, 1994). Examples of non-viral 
vectors include, but are not limited to, liposomes, polyamine derivatives of DNA, and 
the like. 

A "knock-out targeting vector," as the term is used herein, means a 
20 vector comprising two nucleic acid sequences each of which is complementary to a 

nucleic acid regions flanking a target sequence of interest which is to be deleted and/or 
replaced by another nucleic acid sequence. The two nucleic acid sequences therefore 
flank the target sequence which is to be removed by the process of homologous 
recombination. 

25 The invention is further described in detail by reference to the following 

experimental examples. These examples are provided for purposes of illustration only, 
and are not intended to be limiting unless otherwise specified. Thus, the invention 
should in no way be construed as being limited to the following examples, but rather, 
should be construed to encompass any and all variations which become evident as a 

3 0 result of the teaching provided herein. 

Example 1 : Association of SMN with its associated protein SIP1 in a complex with 
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spliceosomal snRNA proteins 

The experiments presented in this example may be summarized as 

follows. 

Previously, molecular characterization of the protein product of the 
5 SMN gene that causes SMA, i e. , SMN, led to the discovery of a novel nuclear 

structure, called gem, in which SMN is concentrated (Liu and Dreyfuss, 1996, EMBO 
J. 15:3555-3565). Gems appear by size, number, and proximity to be related to coiled 
bodies, another subnuclear structure of unknown function. Previous studies suggested 
that gems, like coiled bodies, are involved in RNA metabolism, and, consistent with 

10 this, SMN was found to interact with several RNA binding proteins and possibly also 
with RNA directly (Liu and Dreyfuss, 1996, supra; Liu et al., 1996, Cold Spring 
Harbor Symp. Quant. Biol. 61:689-697). However, the specific function of SMN 
remained obscure. The experiments described herein demonstrate that in a yeast two- 
hybrid screen using SMN as the bait, a novel protein, SIP1, has been identified. SIP1 

15 forms a stable complex with SMN in vivo and in vitro, and it co-localizes with SMN in 

gems and in the cytoplasm. 

The data disclosed herein indicate that SMN and SIP1 function as a 
complex in vivo. For instance, the interaction of SMN and SIP1 in vitro is resistant to 
1 M NaCl, suggesting that the proteins interact avidly. The 300 kDa complex that 

20 contains SMN and SIP1 is stable even in 4 M urea. Also, SMN and SIP1 can be 
coimmunoprecipitated with specific monoclonal antibodies. Further, a yeast two- 
hybrid screen using SIP1 as the bait under high stringency conditions (15 mM 3- 
aminotriazole) isolated from a human library only SMN clones. In addition, SMN and 
SIP1 co-localize in gems, suggesting that these two proteins function together. 

25 Immunopurification of the 300 kDa complex demonstrated that it 

contains, besides SMN and SIP1, spliceosomal snRNP core proteins including B/B', D, 
E, F, and G, the snRNP-specific protein Ul A, and several other unidentified proteins. 
Furthermore, the data disclosed herein demonstrate that SMN interacts directly with 
several spliceosomal snRNP core Sm proteins, including B/B', Dl-3, and E. These data 

30 suggest that the SMN-SIP1 complex plays an important role in spliceosomal snRNP 

biogenesis and/or function. Although BLAST searches of sequence databases with the 
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SIP1 protein did not identify significant homology to any other proteins, visual 
inspection suggested a limited but significant homology between SIP1 and the yeast 
protein Brrl (Noble and Guthrie, 1996, Genetics 143:67-80; 1996, EMBO J. 15:4368- 
4379). The sequence alignment of human SIP1 and yeast Brrl is shown in Figure 1 . 
5 Brrl has been shown to be involved in snRNP biogenesis in Saccharomyces cerevisiae; 
deletion of Brrl causes destabilization of newly synthesized spliceosomal U2 snRNA, 
and Brrl interacts genetically with the yeast Sm Dl protein (Noble and Guthrie, 1996, 
EMBO J. 15:4368-4379). Searching the yeast genome sequence for possible SMN 
homologs has not identified any proteins with significant similarity to SMN. 

10 SMN, because it can interact with SIP1 and with the spliceosomal 

snRNP Sm proteins via different domains, can potentially serve as the key bridging 
component to bring together the various components of the complex. It is therefore 
particularly interesting to note that many SMA patients have deletions or point 
mutations encompassing exons 6 and 7 of SMN, the region shown herein to be 

15 involved in binding of SMN to the Sm proteins. Further, several missense mutations in 

the region of SMN corresponding to the P2 peptide have recently been described 
(Hahnen et al., 1996, Hum. Mol. Genet. 4:1927-1933), and it will be of interest to 
determine if such mutations exhibit altered binding to the Sm proteins. 

The Materials and Methods used in the experiments presented in this 

20 example are now described. 

Yeast Two-Hybrid Screening 

The human HeLa cDNA library, yeast strains, and yeast plasmids 
pGBT9, pGADGH, pVA3, and pTDl were obtained from Clontech Laboratories, Inc. 
(Palo Alto, CA). The manipulation of yeast and the library screening were carried out 

25 according to the conditions suggested by the manufacturer. In brief, the coding region 

of SMN was cloned into the pGBT9 vector. The S. cerevisiae HF7c reporter strain was 
first transformed with the pGBT9-derived SMN construct and, subsequently, with the 
HeLa cDNA library. 

Approximately 6 x 10 6 transformants were seeded on eight 150 mm 

30 plates containing synthetic medium lacking histidine, leucine, and tryptophan. His + 
colonies were grown on synthetic medium plates lacking leucine and tryptophan and 
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were then assayed for fi-galactosidase activity by filter assay as described by the 
manufacturer. Of 6 million transformants screened, 146 were His* LacZ + colonies. 
These positive colonies fell into ten groups. One of these groups had ten independent 
cDNA clones that all encoded identical sequences of SIP1 . The library plasmid was 
5 recovered from these clones into the Escherichia coli HB101 strain. True positive 

clones were confirmed by their ability to transactivate HIS3 and LacZ reporters when 
cotransforming HF7c with pGBT9 containing SMN. 

Production of Monoclonal Antibodies to SIP1 

Anti-SIPl antibodies 2S7 and 2K9 were prepared by immunizing 

10 Balb/C mice with His6-tag SMN chimeric protein purified from nickel chelation 

chromatography using a Novagen (Madison, WI) His-Bind buffer kit. Hybridoma 
production and screening and ascites fluid production were performed as previously 
described (Choi and Dreyfuss, 1984, J. Cell. Biol. 99:1997-2004). 
Production of Proteins In Vitro 

1 5 The 35 [S]-labeled proteins were produced by an in vitro 

transcriptiontranslation reaction (Promega Biotech, Madison, WI) in the presence of 
35 [S]-methionine (Amersham, Arlington Heights, IL). His6-SMN fusion protein was 
expressed from a pET bacterial expression system in the E. coil strain 
BL21(DE3)pLysS and purified using nickel chelation chromatography using the same 

20 kit as described above. GST-SMN fusion protein was expressed from a GST 

expression vector pGEX-5X-3 (Pharmacia Biotech, Piscataway, NJ) in the E. coli 
strain BL21 and purified using glutathione-Sepharose provided by Pharmacia Biotech 
according to the manufacturer's protocol. 

In Vitro Protein-Binding Assays 

25 Purified GST or GST fusion protein (2 micrograms) was incubated with 

10 6 cpm of the in vitro translated protein product and 25 |xl of glutathione-Sepharose 
beads in 500 \i\ of binding buffer (50 mM Tris-HCl [pH 7.5],2 mM EDTA, 0.1 % 
NP40, 2 micrograms per milliliter leupeptin and pepstatin A, and 0.5% aprotinin) 
containing different salt (NaCl) concentrations. Following incubation for 30 minutes at 

30 4°C, the resin was pelleted, washed with binding buffer, and the bound fraction was 

eluted by boiling in SDS-PAGE sample buffer. The proteins were analyzed by SDS- 
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PAGE, and they were visualized by fluorography. For the binding experiments 
described in Figure 5, 200 to 300 ng of GST or GST-SMN fusion protein was bound to 
30 \i\ of glutathione-Sepharose beads dissolved in phosphate buffered saline (pH 7.4) 
and incubated with 35 [S]-labeled in vitro translated protein (approximately 1 x 10 5 to 2 
5 x 10 5 cpm/assay) for 30 minutes at 4°C. The beads were subsequently washed six 

times with phosphate buffered saline/350 mM NaCl (pH 7.4), and the bound protein 
fraction was eluted by boiling in SDS-PAGE sample buffer. 
Cell Culture and Treatments 

HeLa cells and NIH 3T3 cells were cultured in Dulbecco f s modified 
10 Eagle's medium (DMEM; GIBCO BRL, Gaithersburg, MD) supplemented with 10% 

fetal calf serum (FCS, GIBCO BRL). Low temperature incubations were carried out as 
follows. HeLa cells were shifted to 32°C and incubated for 24 hours prior to fixation 
and permeabilization of the cells for immunostaining. For actinomycin D treatment, 
HeLa cells were incubated with 5 micrograms per milliliter of actinomycin D for 3 
15 hours before fixation for immunostaining. For in vivo labeling with 35 [S]methionine, 

50% confluent HeLa cells growing in 100 mm plates were incubated with 10 

35 

microCuries per milliliter of [S]methionine in DMEM without methionine and 
supplemented with 10% FCS overnight before homogenization for 
immunoprecipitation. 

20 Immunoprecipitation and Immunoblotting 

Immunoprecipitation of in vitro translated SIP1 protein was carried out 
in the presence of 1% Empigen BB buffer as previously described by Choi and 
Dreyfuss (1984, J. Cell. Biol. 99:1997-2004). Immunoprecipitation and purification of 
the SMN complex was carried out using total HeLa cell lysate in the presence of 0.5% 

25 TritonX-100 as previously described in Pinol-Roma et al. (1988, Genes Dev. 2:215- 

227). For immunoblotting, proteins were resolved on a 12.5% SDS-polyacrylamide 
gel and transferred to nitrocellulose membrane (Schleicher and Schuell, Inc., Keene, 
NH) using a BioTrans Model B Transblot apparatus (Gelman Sciences, Ann Arbor, 
MI) according to the instructions of the manufacturer. Filters were incubated in 

30 blotting solution (phosphate buffered saline, 5% nonfat milk) for at least 1 hour at 

room temperature, rinsed with cold phosphate buffered saline, and then incubated with 
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primary antibody for at least 1 hour at room temperature. The filters were washed 
three times in phosphate buffered saline containing 0.1% Tween 20, and bound 
antibodies were detected using the peroxidase-conjugated goat anti-mouse IgG plus 
IgM (Jackson ImmunoResearch Laboratories, Bar Harbor, ME). The protein bands 
5 were visualized using an ECL Western blotting detection kit (Amersham, Arlington 

Heights, IL) after washing the filters three times in phosphate buffered saline 
containing 0.1 % Tween 20. 

Immunofluorescence Microscop y 

Immunofluorescence microscopy was carried out essentially as 

1 0 previously described (Choi and Dreyfuss, 1 984, J. Cell. Biol. 99: 1 997-2004) with the 
following modifications. Primary monoclonal antibodies 2B1 and 1816 were diluted 
1 : 1000 in phosphate buffered saline containing 3% bovine serum albumin (BSA). The 
incubation with the first and second antibody was at room temperature for 1 hour. In 
double-label immunofluorescence experiments, a mixture of primary or secondary 

1 5 antibodies was incubated at the same time. Laser confocal fluorescence microscopy 

was performed with a Leica TCS 4D (Germany) confocal microscope. The antibodies 
used for these experiments were as follows: antibody against p80-coilin; monoclonal 
antibody Pd and rabbit polyserum R288; antibody against fibrillarin; human 
autoimmune antibody 1881. Rabbit polyclonal antibody against SMN was raised 

20 against exon 7 for SMN protein by Quality Controlled Biochemicals, Hopkinton, MA 

and the polyclonal antibodies were affinity purified. 

HeLa Cell Fractionation and Chromatography 
HeLa cells were fractionated according to the methods described by 
Dignam et al. (1983, Nuc. Acids Res. 11:1475-1489). Fractionation of the nuclear or 

25 cytoplasmic SI 00 fraction was carried out as follows. The nuclear fraction (200 pi of 

approximately 20 milligrams per milliliter protein) in buffer D (20 mM Tris-HCI [pH 
7.4], 0.1 mM EDTA, 1 mM DTT, 20% glycerol, 500 mM KC1) or SI 00 fraction (400 
microliters of approximately 20 milligrams per milliliter protein) in buffer F (20 mM 
Tris-HCI [pH 7.4], 0.1 mM EDTA, 1 mM DTT, 10% glycerol, 500 mM KC1) was 

30 loaded onto a TSK-GEL G3000-SW glass column (TosoHaas, Montgomeryville, PA). 
The column was then washed with buffer A (20 mM Tris-HCI [pH 7.4], 200 mM 
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NaCl, 2.5% glycerol) with or without 4 M urea at 0.25 ml/min flow rate. Fractions (0.5 
ml) were collected, and 15 jn.1 of each fraction was analyzed on an SDS-PAGE. 
Far Western Analyses Using 35 rS]SMN 

In vitro purified snRNP proteins were analyzed by SDS-PAGE, and the 
5 proteins were then transferred onto a nitrocellulose membrane as described previously 

herein. The nitrocellulose membrane was incubated in blotting solution (phosphate 
buffered saline, 5% nonfat milk) for at least 1 hour at room temperature, rinsed with 
cold phosphate buffered saline, and then incubated with in vitro translated 
35 [S]methionine-labeled SMN (2 X 10 6 cpm) for 2 hours at room temperature. The 
10 nitrocellulose membrane was washed three times in phosphate buffered saline 

containing 0.05% NP40, and bound SMN was detected by exposing the membrane to 
X-ray film. 

Preparation of BSA-Peptides Conjugates 

BSA-peptide conjugates were prepared as described in Fischer et al. 

15 (1995, Cell 82:475-483). In brief, peptides starting with a N-terminal cysteine and 

consisting of amino acids 13-44 (CRRGAGQSDDSDIWDDTALIKAYDKAVS) and 
240-267 (CEDDEALGSMLISWYMSGYHTGYYLGLKQ) of human SMN or the 
fflV-1 Rev NES (CLPPLERLTL) (Fischer et al., 1995, supra) were cross-linked to 
BSA using sulfo-SMCC (Pierce Chemical Co., Rockford, IL) as a cross-linker. The 

20 cross-linked BSA conjugates were separated from unreacted peptides and cross-linking 
reagent by dialysis against phosphate buffered saline (pH 7.4) and the conjugates were 
concentrated using a Centricon30 concentrator (Amicon, Bedford, MA). 

The Results of the experiments presented in this example are now 

described. 

25 SIPL a Novel SMN-Interacting Protein 

Using SMN as a bait in a yeast two-hybrid screen of a HeLa cDNA 
library, ten independent partial cDNA clones with insert sizes ranging from about 1 kb 
to about 1.3 kb were isolated, all of which contained the same open reading frame. The 
longest of these clones, designated 7-10, contained an insert of approximately 1.3 kb 

30 that was completely sequenced. Using the BLAST search program to search the 

GenBank database, an EST (clone #Z64761) (Cross et al., 1994, Nature Genet. 6:236- 
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244) that is identical to the 5' end of clone 7-10 and which extends further upstream 
was identified. Conceptional translation of this cDNA revealed another potential 
methionine 24 amino acids upstream of the first methionine of clone 7-10. 
Immediately upstream of this methionine was a stop codon. It is not clear which 
5 methionine is the actual initiation methionine for the full-length cDNA SIP1 . The 3'- 
untranslated region is very AU-rich and contains a putative polyadenylation site 
AAUAAA. Thus, this is likely the full-length cDNA clone for the novel protein of the 
invention that interacts with SMN and which has been term SIP1 . The predicted amino 
acid sequence of SIP 1 , along with the sequence of the Xenopus laevis homolog that is 

10 also disclosed herein, is presented in Figure 1. The nucleic acid sequence of SIP 1 
disclosed herein encodes a protein of approximately 279 amino acids (including the 
potential 24 amino acids predicted by the EST sequence) with a calculated molecular 
mass of 32 kDa and pi of 5.3. 

To examine the interaction of SIP! with SMN and to characterize SIP! 

1 5 further, monoclonal antibodies to the SIP1 protein were generated by immunizing mice 

with purified recombinant 6His-tag SIP1 (starting with the second methionine) 
chimeric polypeptide. Two of these monoclonals, 2E17 and 2S7, were further 
characterized in detail and were shown to react with SIP1 specifically by both 
immunoprecipitation and by Western blotting. Monoclonal antibody 2E17 reacted 

20 with a protein of similar size in Xenopus; thus, using this as the primary antibody, a 

Xenopus oocyte cDNA library was screened and the Xenopus homolog of SIP1 was 
isolated. The predicted amino acid sequence of Xenopus SIP1 is presented in Figure 1. 
Interestingly, all of the eight clones obtained by screening the Xenopus library with 
2E17 monoclonal antibody lack the first 24 amino acids that are potentially found in 

25 the human EST clone but also missing from the clones obtained by screening the 

human library with this antibody. Xenopus SIP1 is highly similar to human SIP1, the 
two proteins being approximately 90% homologous in amino acid sequence (see 
Figure 1). BLAST searches did not reveal significant homology to any other protein in 
the databases. However, a yeast protein, termed Brrl, appears to have significant 

30 similarity to SIP1 by visual sequence comparison (Figure 1). 

SIP1 Interacts with SMN In Vitro and In Vivo 
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The yeast two-hybrid results were confirmed by assessing the 
interaction of SIP1 with SMN both in vitro and in vivo in HeLa. For the in vitro 
binding assay, SMN was expressed as a chimeric fusion protein with a glutathione S- 
transferase (GST) tag polypeptide. SIP1 was produced and labeled with 
5 35 [S]methionine by in vitro transcription and translation in rabbit reticulocyte lysate. 

GST or GST-SMN proteins immobilized on glutathione-Sepharose were incubated 
with [S]-labeled SIP1 protein. Following washing at various salt concentrations (200 
mM to 1 M), proteins that were still bound to either GST or GST-SMN immobilized on 
glutathione-Sepharose were dissociated by boiling in SDS-containing sample buffer, 

10 and the eluted material was analyzed by SDS-PAGE. As shown in Figure 2A, full- 

length SIP1 bound specifically to immobilized GST-SMN but not to immobilized GST 
alone. This binding appeared to be very avid because it was not disrupted at 1 M NaCl. 

Immunoprecipitation experiments were performed to examine if SMN 
and SIP! interact in vivo. Anti-SMN monoclonal antibody 2B1 (Liu and Dreyfuss, 

15 1996, EMBO J. 15:3555-3565) was used to immunoprecipitate SMN from total HeLa 

cell extract. The immunoprecipitates were then resolved by SDS-PAGE and 
immunoblotted with monoclonal antibody 2S7 specific for SIP1 . As shown in Figure 
2B (lane designated "2B1 IP n ), 2S7 readily detected SIP1 in the 2B1 
immunoprecipitates, indicating that SIP1 was coimmunoprecipitated with SMN. In a 

20 reciprocal experiment, the SMN protein was also coimmunoprecipitated by the anti- 
SIP1 monoclonal antibody 2S7 (Figure 2B, lane designated M 2S7 IP"). These results 
were confirmed using other monoclonal antibodies to SMN and to SIP1 . As shown in 
Figure 2, there was no crossreactivity between the anti-SMN and anti-SIPl monoclonal 
antibodies. These results indicate that SMN and SIP1 are associated in vivo and that 

25 they can be coimmunoprecipitated by either anti-SMN or anti-SIPl monoclonal 
antibodies. 

Gel filtration experiments further confirmed the existence in vivo of a 
complex containing both SIP1 and SMN. HeLa nuclear and cytoplasmic SI 00 extracts 
were fractionated on a high performance gel filtration column, TSK-GEL G3000-SW, 
30 and each fraction was subjected to SDS-PAGE. SMN and SIP1 were detected in the 
column fractions by immunoblotting with specific monoclonal antibodies. Figure 2C 
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depicts the results of the cytoplasmic fractionation experiments. SMNandSIPl 
comigrate, as a peak of approximately 300 kDa, suggesting that they are part of a large 
macromolecular complex. The observed size suggests that this complex contains either 
multiple copies of the SMN and SIP1 proteins l and/or additional components. This 
5 complex is very stable, as it resists dissociation by 4 M urea, and it is observed in both 
the nuclear and cytoplasmic fractions. 

SIP1 and SMN Co-localize in Gems in the Nucleus and in the 

Cytoplasm 

Indirect immunofluorescence microscopy using the anti-SIPl 

10 monoclonal antibodies 2E17 and 2S7 was performed on HeLa cells to determine the 
cellular localization of SIP1. Figures 3 A and 3B illustrate that SIP1 is found 
throughout the cytoplasm and by also displays intense staining of prominent discrete 
bodies in the nucleus as assessed using standard light microscopy immunofluorescence. 
This pattern is very similar to that seen for the SMN protein (Liu and Dreyfuss, 1996, 

15 EMBO J. 15:3555-3565), except that the nucleoplasmic staining of SIP1 is somewhat 

stronger than that seen for SMN. 

In order to determine if the intensely staining nuclear structures are 
gems or coiled bodies, double-label laser confocal immunofluorescence experiments 
were performed using antibodies against p80-coilin (Figure 3C), fibrillarin (Figure 

20 3D), snRNP proteins (Y12, Figure 3E) found in coiled bodies, and SMN (Figure 3F) 

found in gems. Figures 3C, 3D, and 3E depict the results of the double labeling 
experiments using the anti-coiled body antibodies and the anti-SIPl 2S7. The nuclear 
structures that contain SIP1 were clearly different from coiled bodies, but the two 
bodies were, in most cases, closely associated. However, the staining with the anti- 

25 SIP1 monoclonal antibody 2S7 and a rabbit serum raised against exon 7 of SMN 
demonstrate that SMN and SIP1 completely colocalized in gems (Figure 3F). The 
weak signal in the cytoplasm makes it impossible to determine whether SMN and SIP1 
also completely colocalized in the cytoplasm. However, co-localization of SMN and 
SIP1 is very likely because of the tight association of SMN with SIP1 described herein. 

30 The co-localization of SMN with SIP1 further supports the conclusion that these two 
proteins exist as a complex in the cell. SIP1 is thus the second constituent of gems 
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described thus far. 

The SMN-SIP1 Complex Contains Spliceosomal snRNP Proteins 
The observation that SMN and SIP1 are associated in a large 
(approximately 300 kDa) complex prompted experiments to identify any possible 

5 additional components in this complex. In order to identify additional components of 

the SMN-SIP1 complex, immunoprecipitations using anti-SMN and anti-SIPl 
monoclonal antibodies on 35 [S]-labeled HeLa cell lysates were performed, and the 
immunoprecipitated proteins were then analyzed by SDS-PAGE. As shown in Figure 
4 A, similar patterns were obtained using anti-SMN and anti-SIPl monoclonal 

10 antibodies to immunoprecipitate complex-associated proteins. Several proteins were 

specifically coimmunopurified by anti-SMN and anti-SIPl antibodies. Besides SMN 
and SIP1 proteins, there was a prominent doublet at approximately 97 kDa, the group 
of proteins of approximately 28 kDa, and the group of proteins of approximately 15 
kDa. This protein complex was quite stable, since it was resistant to SDS/Triton 

15 XlOO/deoxycholate-containing buffer and to a high salt wash (500 mM NaCl). As a 

reference for these immunoprecipitations (and for reasons discussed herein), a lane was 
included in the gel demonstrating an immunoprecipitation with the monoclonal 
antibody Y12 (Figure 4A, lane designated "Y12") which antibody is directed against 
the Sm proteins common to spliceosomal snRNPs (Lerner and Steitz, 1979, Proc. Natl. 

20 Acad. Sci. USA 71 :5495-5499; Lerner et al., 1981, Proc. Natl. Acad. Sci. USA 
78:2737-2741). To characterize this complex further, the protein samples were 
subjected to high resolution, two-dimensional nonequilibrium pH gradient gel 
electrophoresis (NEPHGE). Figure 4B depicts the major proteins that were 
specifically found in the anti-SMN (2B1) isolated complex but not in control (SP2/0) 

25 immunoprecipitations. The major proteins selectively immunoprecipitated by anti- 

SMN (2B1) are labeled as SIP1, SIP2, SIP3, and SIP4. Further, the group of basic, low 
molecular weight proteins in the anti-SMN immunoprecipitate exhibited the same 
pattern as some of the Sm proteins in immunopurified snRNPs. For direct comparison, 
anti-Sm (Y12) immunoprecipitate from HeLa nuclear fractions was analyzed in 

30 parallel, and Ul A, Sm B/B f , Dl-3, E, F, and G proteins of snRNPs migrated in exactly 
the same pattern as those proteins in the SMN-SIP1 complex. Immunoblotting 
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experiments using monoclonal antibodies against the Ul snRNP-specific protein UtA 
and the anti-Sm monoclonal antibody Y12 confirmed that these proteins were indeed 
the spliceosomal snRNP proteins. Immunoprecipitations using Y12 starting with either 
total HeLa extract or that derived from nucleoplasm also demonstrated that SMN and 
5 SIP1 could be detected in Y12 immunoprecipitates by immunoblotting. The 

immunoprecipitations shown in Figure 4 were carried out using nucleoplasm as the 
starting material. Similar results, although with considerably higher background, were 
obtained using cytoplasmic or whole cell extracts, and predigestion with RNases did 
not reduce the signal. This suggests the SMN-SIPl-Sm protein complexes can be 

10 found both in the nucleus and in the cytoplasm. It has not been definitively determined 
whether the immunoprecipitated SMN-SIPl-Sm complexes contain snRNAs since, 
e.g. , the RNase resistance of the complexes may simply be due to the fact that the 
RNAs were not accessible to RNase. However, experiments described herein 
demonstrate that SMN and SIP i immunoprecipitated labeled snRNAs suggesting the 

1 5 complex may contain such ribonucleic acids. 

The SMN Protein Interacts with Sm B/B\ Sm D. and Sm E Proteins 

Directly 

Most of the snRNAs in snRNP complexes are resistant to RNase 
digestion and this, therefore, makes it difficult to determine if the SMN-SIP1 -snRNP 

20 protein complexes result from protein-protein or protein-RNA interactions. To 

investigate whether SMN interacts with snRNP proteins directly, in vitro translated 
[S]methionine-labeled SMN was used in a far- Western blot assay to probe SDS- 
PAGE-resolved proteins of purified snRNPs. The protein composition of purified 
snRNPs is shown in Figure 5 (lane designated "snRNPs"). The result of probing these 

25 proteins with 35 [S]SMN (Figure 5, lane designated " 35 [S]SMN") indicates that SMN 
specifically and directly bound to SmB/B t and also to one or several of the Sm D 
proteins. When the same experiments were performed with in vitro translated SIP1, no 
specific binding of SIP1 to snRNP proteins was detected although, in the same assay, 
SIP1 bound strongly to recombinant SMN protein. These findings suggest that the 

30 association of SMN with snRNPs occurs via direct protein-protein interaction between 
SMN and Sm B/B' and one or more of the D group proteins although the possibility 
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that some component in the rabbit reticulocyte lysate mediates this interaction cannot 
be excluded. 

The binding in solution of SMN to other Sm proteins was examined 
using in vitro translated and 35 [S]-labeled Sm proteins B, Dl, D2, D3, E, F, and G 

5 which were tested for their ability to bind to recombinant GST-SMN fusion protein 
immobilized on glutathione-Sepharose beads pursuant to the methods described by 
Lehmeier et al. (1994, Proc. Natl. Acad. Sci. USA 91:12317-12321), Herrmann et al. 
(1995, EMBO J. 14:2076-2088) and Raker et al. (1996, EMBO J. 15:2256-2269). As 
shown in Figure 5B, all Sm proteins, except for F and G, bound efficiently to 

10 immobilized GST-SMN, whereas there was no detectable binding to immobilized GST 
alone. Moreover, similar experiments failed to detect any interaction between the Sm 
proteins and SIP1. 

SMN Contains Two Distinct Binding Sites for the Sm Proteins and for 

SIP1 

1 5 The data disclosed previously herein demonstrate that SMN interacts 

with both SIP1 and with several of the Sm proteins; therefore, it was determined 
whether binding of SMN to Sm proteins and to SIP1 was mutually exclusive or 
whether SMN could bind both Sm proteins and SIP1 at the same time possibly via two 
different binding sites on a single SMN protein. Truncated peptides were synthesized 

20 corresponding to the two most conserved regions of SMN (determined by comparing 
the sequence of the human SMN with that of Xenopus SMN) located at the N terminus 
(at amino acids 13-44) and at the C terminus (at amino acids 240-267), under the 
assumption that these highly conserved domains may be involved in important protein- 
protein interactions. These regions are also conserved in several candidate SMN 

25 orthologs identified in divergent organisms (Talbot et al., 1997, Hum. Mol. Genet. 

3:497-500). These truncated peptides were then coupled to BSA (termed BSA-P1 and 
BSAP2, respectively) and used as competitors in the binding assays for SMN to SIP1 
and for SMN to the Sm proteins. Without competitors, SIP1 and Sm B bound to GST- 
SMN (Figure 5C) as noted previously herein. However, BSA-P1 completely abolished 

30 binding of SIP1 to SMN, while the binding of Sm B was unaffected (Figure 5C). In 
contrast, BS A-P2 strongly inhibited the binding of SMN to Sm B but had no effect on 



the binding of SMN to SIP1 . BSA coupled to HIV-1 Rev NES protein (BSA-Ctrl) 
(Fischer et al.,1995), was used as a non-specific control and this chimeric protein had 
no effect on the binding of SMN to either SIP1 or to Sm B protein (Figure 5C). 
Additional experiments demonstrated that the corresponding domains of SMN are 
5 alone sufficient for binding to SIP1 and Sm B, and similar results were obtained for the 
other Sm proteins. Thus, the data disclosed herein define two independent binding 
sites for SIP1 and the Sm proteins on a single SMN protein molecule. SMN may 
therefore serve as a critical bridge between the Sm proteins and SIP1 and SMN may 
nucleate the formation of the SMN-SIPl-Sm complex. 
10 Example 2: The SMN-SIP1 complex has an essential role in spliceosomal snRNP 
biogenesis 

The experiments presented in this example may be summarized as 

follows. 

The data disclosed herein and obtained from micro injection studies in 

1 5 Xenopus laevis oocytes provides important insight into the function of the SMA 

disease gene product, SMN, and its associated protein SIP1 . SMN and SIP1 are tightly 
associated as two subunits of a heteromeric protein complex, and both are found in the 
oocyte cytoplasm. Further, as previously disclosed herein, SMN and SIP1 are 
associated in a complex of approximately 300 kDa which also contains the Sm 

20 proteins, and several additional spliceosomal snRNP-specific proteins. The present 
experiments demonstrate that the SMN-SIP1 complex has an essential role in 
spliceosomal snRNP biogenesis. Several lines of evidence lead to this conclusion. 
First, SMN and SIP1 are specifically associated in the cytoplasm with Ul and U5 
snRNAs but not with nuclear snRNPs and not with other RNAs tested so far. Thus, 

25 they are not components of mature nuclear snRNPs but rather are associated with them 

only during the cytoplasmic phase of their biogenesis. Second, anti-SIPl antibodies 
strongly interfere with the assembly of the Sm core domain of spliceosomal U snRNAs 
and with their nuclear import. Third, anti-SMN antibodies, surprisingly, have the 
opposite effect and stimulate the assembly of Sm proteins onto the Sm site of 

30 spliceosomal U snRNAs. In addition, as previously set forth herein, two distinct 
domains in SMN, PI and P2, have been identified that mediate its interaction with 
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SIP1 and with several Sm proteins, respectively, in addition to its capacity to interact 
with itself (Liu and Dreyfiiss, 1996, EMBO J. 15:3555-3565). 

The Materials and Methods used in the experiments presented in this 
example are now described. 

Western Blot Analysis 

Oocytes were defolliculated and dissected into nuclear and cytoplasmic 
fractions (see methods set forth herein). The nuclei were precipitated in ethanol. The 
cytoplasmic fractions were homogenized in 5:1 buffer pursuant to Fischer et al., 1993, 
EMBO J. 12:573-583) and centrifiiged for 15 minutes at 4°C The supernatant was 
transferred to a new test tube and precipitated with four volumes of acetone. After 
centrifugation, the pellets were washed with 70% ethanol, were dried, and the pellets 
were resuspended in SDS-PAGE sample buffer. For Western blot analysis, proteins 
were separated on an SDS-polyacrylamide gel (12.5%) and subsequently transferred to 
a nitrocellulose membrane (Schleicher and Schuell, Inc., Keene, NH) using a Bio- 
Trans Model B Transblot apparatus (Gelman Science) according to the manufacturer's 
instructions. 

After protein transfer, the blotting membrane was incubated in blotting 
solution (phosphate-buffered saline, phosphate buffered saline, containing 5% nonfat 
milk) for 1 hour at room temperature, rinsed with phosphate buffered saline, and then 
incubated in blotting solution with the primary antibody for 1 hour at room 
temperature. The membrane was then washed three times with phosphate buffered 
saline containing 0.1 % Tween 20, and bound antibodies were detected using 
peroxidase-conjugated goat antimouse IgG plus IgM (Jackson ImmunoResearch 
Laboratories, Bar Harbor, ME). The proteins were visualized using an ECL Western 
blotting detection kit (Amersham, Arlington Heights, IL) after washing the membrane 
three times in phosphate buffered saline containing 0.1 % Tween 20. 

Oocyte Injections 

Oocyte injections were carried out as described in Fischer et al. (1993, 
supra). In brief, oocytes were incubated for 3 hours in modified Barth's solution 
containing 0.2% collagenase type II (Sigma Chemical Co., St. Louis, MO). 
Defolliculated stage V and VI oocytes were collected and usually used on the same day 
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for micro injection. 

In a typical injection experiment, 30 nl of 32 [P]-labeled RNA (1 x 10 6 
cpm/jil; total concentration of 0.7 jjM) was injected either into the nucleus or into the 
cytoplasm. For the antibody inhibition experiments, oocytes were preinjected with 
5 antibody (1 micrograms/^ or 3 micrograms/^1 in Figure 9B) and the oocytes were 
incubated for 1 hour before they received a second injection of 32 [P]-labeled RNA. 
Nucleo-cytoplasmic transport of injected RNAs was monitored by dissection of the 
oocytes into nuclear and cytoplasmic fractions. Both fractions were incubated for 20 
minutes in homogenization buffer, and the RNAs were isolated and analyzed by 
10 electrophoresis on denaturing RNA gels as described in Hamm et al. (1990, Cell 
62:569-577). 

Immunoprecipitation of RNA-Protein Complexes 
Immunoprecipitation of RN A-protein complexes was performed 
pursuant to Fischer et al., 1993, supra). Briefly, the injected oocytes were 

15 homogenized in 300 |xl of ice-cold phosphate buffered saline (pH 7.4). The insoluble 
fraction was pelleted by centrifiigation, and the clear supernatant was transferred into a 
new 1.5 ml Eppendorf tube containing antibodies bound to protein G-Sepharose beads 
(Pharmacia). This mixture was incubated with constant shaking for 1 hour at 4°C and 
the beads were subsequently washed five times with 1 ml aliquots of ice-cold 

20 phosphate buffered saline. Bound RNAs were isolated by phenol extraction for 1 hour, 
and the RNAs were then precipitated with ethanol, and analyzed by denaturing gel 
electrophoresis as described previously herein. 

Plasmid DNA In Vitro Transcription and Translation 

Plasmids coding for Ul, U2, U4, U5, and U6 snRNAs have been 

25 described previously in Hamm et al. (1990, Cell 62:569-577) and Fischer et al. (1993, 

EMBO J. 12:573-583; 1995, Cell 82:475-483). Plasmids encoding dihydrofolate 
reductase mRNA (DHFR) and tRNAl were described in Jarmolowski and Mattaj 
(1993, EMBO J. 12:223-232), Jarmolowski et al. (1994, J. Cell Biol. 124:627-635 ), 
and in Fischer et al. (1995, Cell 82:475-483). A plasmid coding for the human U3 

30 snoRNA. Clones encoding the Sm proteins B, Dl, D2, D3, E, F, and G are described 
in Raker et al. (1996, EMBO J. 15:2256-2269). The clone encoding SIP1 has been 
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described previously herein. In vitro transcription of [P]-labeled RNAs was carried 
out exactly as described in Fischer et al. (1993, EMBO J. 12:573-583). Labeled RNA 
was precipitated in ethanol and resuspended in water. For nuclear injection, RNA was 
dissolved in water containing 10 milligrams per milliliter dextran blue (MW 
5 1 ,000,000). In vitro translation of 35 [S]-labeled proteins was carried out using a 

combined transcription and translation kit (TnT) (Promega Biotech, Madison, WI) 
according to the instructions of the manufacturer. 

The Results of the experiments presented in this example are now 

described. 

10 As previously disclosed herein, a novel protein complex containing 

SMN, SIP1, and spliceosomal snRNP proteins, including several of the Sm proteins, 
has been discovered in the cytoplasm of human cells. The data presented herein were 
obtained in the Xenopus oocyte system which is an advantageous, well-characterized 
system in which to study spliceosomal snRNP biogenesis by use of microinjections 

15 (Mattaj and DeRobertis, 1985, Cell 40: 1 1 1-118; Mattaj, 1986, Cell 46:905-91 1). 

SMN and SIP1 Are Associated with Spliceosomal snRNAs in the 

Cytoplasm 

As previously disclosed herein, a novel protein complex containing 
SMN, SIP1, and spliceosomal snRNP proteins, including several of the Sm proteins is 

20 present in the cytoplasm of human cells. Because the Xenopus oocyte provides a 
particularly advantageous and well-characterized system in which to study 
spliceosomal snRNP biogenesis by use of microinjections (Mattaj and DeRobertis, 
1985; Mattaj, 1986), it was first determined whether SMN and SIP1 are present in 
these oocytes. If so, the unique features of this system could then be used to 

25 investigate the possible functions of these proteins in snRNP biogenesis. 

Immunoblotting with monoclonal antibodies to the human SMN and 
SIP1 proteins (2B1 and 2E17, respectively) on Xenopus tissue culture cells detected 
proteins of similar size to the corresponding human proteins. cDNA cloning, 
sequencing, and transfection experiments confirmed that these proteins are the Xenopus 

30 homologs of SMN and SIP1 . Surprisingly, however, unlike the situation in somatic 
cells where there is more SMN and SIP1 in the nucleus, both proteins were found 
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almost exclusively in the oocyte cytoplasm (Figure 6A). The high cytoplasmic 
concentration of SMN and SIP1 in the Xenopus oocyte is reminiscent of the large 
amounts of Sm proteins that are stored in the oocyte cytoplasm (Zeller et al., 1983, Cell 
32:425-434). 

5 In order to identify potential cellular targets for the SMN-SIP 1 complex, 

the association of the complex with RNA was investigated. Various 32 [P] -labeled 
RNAs were generated by transcription in vitro, including mRNA, tRNA, U3 snoRNA, 
5S RNA, and the spliceosomal snRNAs Ul, U2, U4, U5 5 and U6. Different mixtures 
of these RNAs were then coinjected into the cytoplasm of oocytes; and 

10 immunoprecipitations were carried out with anti-SMN (2B1) and anti-SIPl (2E17) 
monoclonal antibodies. As shown in Figure 6B, only Ul and U5 snRNAs were 
efficiently immunoprecipitated, indicating that they interact with SMN and SIP1. A 
weak but reproducible immunoprecipitation of U4 snRNA was also observed. In 
contrast, other RNAs, including mRNA, tRNA, U3 snoRNA, U6 snRNA, U2 snRNA 

15 (Figure 6B), and 5S rRNA were not immunoprecipitated at significant levels with 

either the anti-SMN or the anti-SIPl antibodies. 

Since neither SMN nor SIP1 is detectable in the oocyte nucleus (Figure 
6 A), it seemed possible that they are not associated with mature Ul and U5 snRNPs 
but rather only during the cytoplasmic phase of their biogenesis pathway. To ascertain 

20 this, Ul and U5 snRNAs were injected into the cytoplasm of oocytes, and the oocytes 
were incubated for 12 hours (Figure 7). After this incubation period, approximately 
50% of the injected snRNA was transported to the nucleus while the rest was still in 
the cytoplasm. Immunoprecipitations from the nuclear and cytoplasmic fractions were 
then carried out using either anti-SMN antibody, anti-SIPl antibody, or the anti-Sm 

25 monoclonal antibody Y12, and the coimmunoprecipitated RNAs were analyzed by 
denaturing gel electrophoresis followed by fluorography. As previously discussed 
herein, anti-Sm monoclonal antibody Y12 recognizes a subset of the Sm proteins and 
was used in this experiment to monitor the assembly of the Sm core domain (Lerner et 
al., 1981, Proc. Natl. Acad. Sci. USA 78:2737-2741). As previously reported (Mattaj, 

30 1986, Cell 46:905-91 1; Fischer and Luhrmann, 1990, Science 249:786-790), Ul and 

U5 snRNAs were immunoprecipitated by Y12 in approximately equal amounts from 
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the nucleus and cytoplasm (Figure 7). This indicated that the Sm proteins associate in 
the cytoplasm with the snRNA and then move as an assembled and stable snRNP 
complex to the nucleus. In striking contrast, however, SMN and SIP1 association with 
Ul and U5 snRNAs was observed only in the cytoplasm (Figure 7). 
5 Further evidence for the physiological relevance of the interaction of 

SMN and SIP1 in the cytoplasm with spliceosomal snRNPs was obtained following 
nuclear injections of Ul snRNA. Only after export to the cytoplasm and during the 
cytoplasmic phase of their biogenesis could Ul snRNA be immunoprecipitated by anti- 
SMN or anti-SIPl antibodies. Thus, the SMN-SIP1 complex interacts with Ul and U5 
10 snRNAs in the cytoplasm but not after these snRNAs have been assembled into 

snRNPs and have been imported into the nucleus. Therefore, SMN and SIP1 dissociate 
from the spliceosomal snRNPs either prior to nuclear entry or shortly thereafter. 

Anti-SIPl Antibodies Inhibit Spliceosomal snRNP Assembly and 
Nuclear Import 

1 5 The association of SMN and SIP 1 with Ul and U5 snRNPs in the 

cytoplasm suggested a role for these proteins in the biogenesis pathway of these 
snRNPs, i.e., in the assembly of snRNP proteins onto these snRNAs and/or in the 
nuclear import of these particles. Therefore, whether anti-SMN or anti-SIPl antibodies 
have an effect on the nucleo-cytoplasmic transport of spliceosomal snRNPs was 

20 examined (Figure 8). Anti-SMN (2B1), anti-SIPl (2E17), or anti-Sm (Y12) antibodies 

were injected into the cytoplasm of oocytes. One hour later, a mixture of Ul, U5, and, 
as a control for nuclear injection, U6 snRNA was injected into the nuclei of the same 
oocytes, and the incubation was continued for 15 hours. The oocytes were then 
fractionated, and the RNAs in the nucleus and cytoplasm were analyzed. In control 

25 oocytes preinjected with water, nuclear-injected Ul and U5 snRNAs were exported to 
the cytoplasm and, after cytoplasmic assembly of the Sm core domain, reimported to 
the nucleus (Figure 8 A) (Hamm et al, 1990, Cell 62:569-577; Neuman de Vegvar and 
Dahlberg, 1990, Mol. Cell. Biol. 10:3365-3375; Terns et al., 1993, Genes Dev. 7:1898- 
1906). This was confirmed by immunoprecipitation of Ul and U5 snRNAs with anti- 

30 Sm antibodies and by the observation that the nuclear pool of Ul snRNA had 

undergone 3 f end trimming (see Figure 8A, nuclear fractions). The latter has been 
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previously shown to occur in the cytoplasm prior to nuclear import (Neuman de 
Vegvar and Dahlberg, 1990, supra; Terns et al., 1993, supra). In oocytes preinjected 
with anti-Sm antibody Y12, Ul and U5 were also exported to the cytoplasm; however, 
they were not ramparted to the nucleus, resulting in the accumulation of these RNAs in 
5 the cytoplasm (Figure 8 A). This is because upon binding to the Sm proteins, Y12 

interferes with the subsequent steps in the biogenesis of snRNPs that are required for 
their nuclear import as discussed below. Surprisingly, a similar result was obtained in 
oocytes preinjected with anti-SIPl antibodies (Figure 8A). However, no effect on 
nuclear import of Ul and U5 snRNA was observed in oocytes injected with anti-SMN 

10 antibodies (Figure 8 A). 

Next, it was determined whether anti-SIPl antibodies interfere with the 
nuclear import of the other spliceosomal snRNPs. In vitro transcribed snRNAs Ul, 
U2, U4, and U5 were injected into the cytoplasm of oocytes, either without or with 
anti-SIPl antibody, and nuclear import was then assessed 15 hours later (Figure 8B). 

15 In the absence of anti-SIPl antibody injection, all snRNAs accumulated in the nucleus 
to approximately 50%, although the import of U4 was less efficient (Figure 8B). 
However, in the presence of anti-SIPl antibody, the nuclear import of Ul, U2, and U5 
was almost completely inhibited, and the import of U4 was delayed by at least 50% 
(Figure 8B). Thus, anti-SIPl antibodies interfere with the nuclear import of all 

20 spliceosomal U snRNPs tested regardless of whether they can be efficiently 

immunoprecipitated with the anti-SMN or anti-SIPl antibodies. This suggests that the 
interaction of SMN and SIP with some snRNAs is transient and cannot be monitored 
by immunoprecipitation. 

As discussed previously herein, assembly of the Sm core domain and 

25 the formation of the m 3 G cap are required for the nuclear import of U snRNPs (Mattaj 

and DeRobertis, 1985, Cell 40: 111-118; Fischer and Luhrmann, 1990, Science 
249:786-790; Hamm et al., 1990, Cell 62:569-577; Fischer et al., 1993, EMBO J. 
12:573-583). Therefore, anti-SIPl antibodies may inhibit U snRNP import by 
interfering either with the assembly of the Sm core or with the cap hypermethylation or 

30 both. Alternatively, and not mutually exclusive, the antibody could directly interfere 

with the snRNP transport process, e.g., by blocking transport factors. 
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To further examine the inhibitory effect of anti-SIPl antibodies on U 
snRNP nuclear import, the effect of anti-SIPl antibodies on the assembly of the Sm 
core domain was studied (Figure 9A). Anti-SIPl, anti-SMN or antiSm (Y12), and, as 
negative controls, nonimmune antibodies SP2/0 or water, were injected into the 
5 cytoplasm of oocytes, followed by incubation for 1 hour before receiving a second 

cytoplasmic injection of a mixture of 32 [P]-labeled Ul, U5, and U6 snRNAs. One hour 
later, the Sm core assembly on the injected RNAs was analyzed by 
immunoprecipitation with Y12 (Figure 9 A). In control oocytes preinjected with either 
water or SP2/0, both Ul and U5 snRNAs were efficiently immunoprecipitated with 

10 Y12, indicating that these RNAs assembled with the Sm proteins. As expected, U6 

snRNA was not immunoprecipitated because this RNA does not contain an Sm site and 
thus cannot bind Sm proteins. However, injection with anti-SIPl antibody completely 
inhibited the Sm core assembly on Ul and U5 snRNAs (Figure 9A). Y12 itself 
strongly reduced the Sm core assembly on both Ul and U5 snRNAs to a similar extent 

1 5 (Figure 9 A). The anti-SMN antibody 2B 1 , in contrast, did not inhibit but rather 

slightly stimulated assembly of Sm proteins (Figure 9 A). If the snRNAs are injected 1 
hour prior to injection of the anti-SIPl antibody, no interference with subsequent Y12 
immunoprecipitation is observed. The observed inhibition of Sm core assembly by 
pre-injection of anti-SIPl is therefore not due to a nonspecific occlusion of the Sm 

20 epitope by the anti-SIPl antibody (Figure 9A). 

The effects of injection of anti-SIPl antibodies on the assembly of U2 
and U4 snRNAs were also determined. As shown in Figure 9A, pre-injection of anti- 
SIPl antibodies strongly interfered with the Sm core assembly on both U2 and U4 
snRNAs, while in oocytes preinjected with water, both snRNAs assembled with the Sm 

25 proteins. The inhibition of nuclear import of snRNAs Ul , U2, U4, and U5 by anti- 
SIPl antibodies is, therefore, at least partially due to the interference of the anti-SIPl 
antibody with the Sm core domain assembly. Thus, without wishing to be bound by 
theory, SIP1 is a cytoplasmic assembly factor that mediates the formation of the Sm 
core domain on spliceosomal U snRNPs. 

30 Anti-SMN Monoclonal Antibody Stimulates Sm Protein Binding onto 

the Sm Site of U snRNAs 
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Although SMN is in a tight complex with SIP1 and, therefore, is likely 
to form a functional unit with SIP1, the anti-SMN antibodies used in the experiments 
described above, unlike the anti-SIPl antibodies, did not interfere with U snRNP 
biogenesis; rather, anti-SMN antibodies stimulated core domain assembly. Therefore, 
5 the function of SMN in snRNP assembly was examined further. As shown in Figure 

9B, injection of anti-SMN antibodies (2B1) did not inhibit snRNP assembly. However, 
upon injection of high concentrations (2 to 3 micrograms/nl ) of this anti-SMN 
antibody, the assembly of snRNPs was often enhanced. In order to analyze this effect 
in more detail, a mixture of U5 and U6 snRNAs was injected along with either high (3 

10 micrograms/|il ) or low (1 micrograms/fil) concentrations of the anti-SMN antibody 

2B1 or the anti-SIPl antibody 2E17 (Figure 9B). Sm protein binding onto these RNAs 
was then assessed by immunoprecipitation with the anti-Sm antibody Y12 one hour 
later. After 1 hour, the assembly of Sm proteins onto U snRNAs was not complete, 
thus allowing a more quantitative evaluation of the efficiency of Sm protein binding. 

15 In the absence of coinjected antibody, U5 but not U6 was precipitated by anti-Sm, 

indicating Sm core formation on U5 snRNA but not on U6 at that time point. 
However, while low concentrations of 2B1 had only a slight stimulatory effect on Sm 
protein binding, coinjection of higher 2B1 concentrations significantly enhanced the 
assembly approximately 2- to 3-fold (Figure 9B). Coinjected anti-SIPl antibody, in 

20 contrast, inhibited the assembly of Sm proteins onto U5 snRNA almost entirely at both 

low and high antibody concentrations (Figure 9B). Thus, 2B1 stimulates Sm protein 
binding onto U5 snRNA and hence Sm core formation, indicating that SMN is also 
involved in spliceosomal U snRNP assembly. 

The data disclosed herein further demonstrate that 2B1 stimulates Sm 

25 protein binding onto an artificial snRNA, termed Smll RNA, that consists of the Sm 
site and stem/loop E of Ul snRNA and an artificial stem/ loop 5 1 to the Sm site. The 
capacity of this RNA to bind Sm protein is severely compromised, and, as a 
consequence of this, its nuclear import is greatly reduced (Fischer et al., 1993, EMBO 
J. 12:573-583). Employing the same injection strategy as described above, a strong 

30 stimulation of Sm protein binding onto Smll RNA was observed after injection with 

anti-SMN antibody 2B1, and, as a consequence, Smll was efficiently imported into the 
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nucleus. Taken together, and without wishing to be bound by theory, these data and 
those presented in the preceding sections strongly suggest that both SIP1 and SMN are 
directly involved in the assembly of the Sm core domain of spliceosomal U snRNPs. 

Example 3: Specific sequences in SMN and SIP1 which mediate their 
interactions with each other and with Sm proteins and which are associated with 
defective interactions in SMA 

The experiments presented in this example may be summarized as 

follows. 

Deletion mutants of SMN and SIP1 were prepared and used to identify 
the domains involved in the interactions between SMN and SIP1 as well as their 
interactions with their associated proteins such as the Sm proteins. Further, the effects 
of specific deletions and point mutations on the composition of the SMN complex and 
on the SMN and SIP1 cellular localization, with particular interest in targeting to gems, 
were analyzed. The data disclosed herein demonstrate that several SMN interactions 
are affected by mutations that occur in some SMA patients and that SMN 
oligomerization greatly enhances its interaction with Sm proteins. These results 
suggest a model of the SMN complex in which a SMN/SIP1 tetramer (or a higher 
oligomer) is the functional core required for efficient binding to Sm proteins, and thus 
snKNP assembly. These findings further strengthen the view that SMA is the result of 
a defect in snRNP metabolism. 

The Materials and Methods used in the experiments presented in this 
example are now described. 

Plasmid Construction 

DNA fragments corresponding to the open reading frames (ORFs) of 
SMN and SIP1 wild-type (wt) and mutant proteins were generated by polymerase 
chain reaction (PCR) amplification using suitable primers. All of the myc-tagged 
constructs were generated by cloning the PCR inserts into a modified pcDNA3 vector 
(InVitrogen, Carlsbad, C A) downstream from the myc epitope recognized by the 
monoclonal antibody 9E10 (Siomi and Dreyfiiss, 1995, J. Cell. Biol. 129:551-560). 
Plasmid constructs containing SMN fusions to the carboxyl-terminal to the myc- 
pyruvate kinase (PK) were obtained by cloning the SMN coding PCR fragments into a 
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myc-PK vector derived from pcDNA3 as previously described (Nakielny and Dreyfuss, 
1996, J. Cell. Biol. 134:1365-1373). Maltose binding protein (MBP)-SMN fusions 
were obtained by cloning the SMN coding PCR fragments into a modified pcDNAl 
vector (InVitrogen, Carlsbad, CA) downstream from the MBP sequence. These 
5 vectors facilitated the expression driven by the CMV promoter in vivo and by the T7 
promoter in vitro. All of the constructs were analyzed using DNA sequencing. 
Production of proteins in vitro 

The [ 35 S]methionine-labeled proteins were produced by an in vitro 
coupled transcription-translation reaction (Promega Corp., Madison, WI) in the 

10 presence of [ 35 S]methionine (Amersham, Arlington Heights, IL). His6-tagged SMN 
and SmB fusion proteins were expressed from a pET28 bacterial expression system in 
the Kcoli strain BL21(DE3)pLysS and the fusion protein was purified by Ni+ chelation 
chromatography with the Novagen (Madison, WI) His-bind Buffer Kit following the 
manufacturer's protocol. All the GST fusion proteins were expressed from the GST 

15 expression vector pGEX-5X (Pharmacia) in the E.coli strain BL21(DE3)pLysS and 

were purified using glutathione-Sepharose according to the manufacturer's protocol 
(Pharmacia Biotech). SmB cDNA is described in Raker et al. (1996, EMBO J. 
15:2256-2269). SMN and SIP1 cDNAs were obtained using a yeast di-hybrid 
screening assay previously described herein. 

20 In vitro protein-binding assay 

Purified GST or GST fusion proteins (1-3 micrograms) were incubated 
with 10 6 cpm of the in vitro translated protein product and 25 \x\ of glutathione- 
Sepharose beads in 1 ml of binding buffer (50 mM Tris-HCl [pH 7.5], 200 mM NaCl, 2 
mM EDTA, 0.1% NP-40, 2 micrograms per milliliter leupeptin and pepstatin A, and 

25 0.5% aprotinin). After incubation for 1 hour at 4°C, the resin was pelleted, washed five 

times with 1 ml of binding buffer, and the bound protein fraction was eluted by boiling 
the washed beads in SDS-PAGE sample buffer. The eluted proteins were analyzed by 
SDS-PAGE on a 12.% polyacrylamide gel, and the radiolabel signal was enhanced by 
treatment with Amplify solution (Amersham, Arlington Heights, IL). 

30 In the preincubation experiments, the indicated molar excess of purified 

recombinant His-tagged SMN proteins were incubated with GST or GST-SMN, 
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previously bound to glutathione-Sepharose beads, for 1 hour at 4°C in 1 ml of binding 
buffer. Unbound proteins were eliminated using five washes of the beads with binding 
buffer. After the beads were washed to remove unbound proteins, the in vitro 
translated proteins were added to the beads and binding was performed as described 
5 previously herein. 

Gel-filtration Chromatography 

Purified recombinant His-tagged SMN, SMNY272C, or SMNAEx7 
(also referred to as SMNAC16) (50 micrograms), and SmB (25 micrograms) were 
incubated, individually or mixed as indicated, for 1 hour on ice in 0.25 milliliters of a 

10 buffer containing 50 mM Hepes, pH 7.9, 400 mM KC1, 0.5 mM EDTA, 2.5 mM DTT. 
The samples were then applied to a TSK-GEL G3000-SW glass column (08800; 
Tosohaas, Montgomeryville, PA). The column was equilibrated in the same buffer as 
mentioned previously herein. One minute fractions were collected at a 0.25 milliliter 
per minute flow-rate. Selected fractions were pooled as indicated elsewhere herein, 

15 and the samples were analyzed using SDS-PAGE and Western blotting using anti-T7 

tag monoclonal antibody (Novagen, Madison, WI). 

Cell culture and Immunoprecipitation 

293T cells were cultured in Dulbecco ? s modified Eagle's medium 
(DMEM) (GIBCO BRL, Gaithersburg, MD) supplemented with 10% fetal bovine 

20 serum (FBS) (GIBCO BRL) and transfected by the standard calcium phosphate 

procedure. Following 36 to 48 hours posttransfection, the cells were collected and 
processed by immunoprecipitation. Immunoprecipitations were performed using total 
cell ly sates prepared in the presence of 0.5% Triton X- 100 as described previously in 
Pinol-Roma et al. (1988, Genes Dev. 2:215-227). 

25 Immunoblotting was performed as described previously elsewhere 

herein. The antibodies used for these experiments were as follows: mouse monoclonal 
anti-SIPl (2E17, described previously elsewhere herein), mouse monoclonal anti-Sm 
(Y12; Lerner et al., 1981, Proc. Natl. Acad. Sci. USA 78:2737-2741), mouse 
monoclonal anti-myc (9E10; ATCC, Manassas, VA), and mouse monoclonal anti-T7 

30 tag (Novagen, Madison, WI). 

The Results of the experiments presented in this example are now 
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described. 

SMN mutations of SMA patients affect the direct interaction of SMN 
with itself and with SmB 

In an in vitro binding assay, purified recombinant His-tagged SMN and 
5 SmB proteins bound to a GST-SMN fusion protein but not to GST alone (Figure 10A). 
This demonstrates that SMN interacts directly with itself and, although with a low 
affinity, with SmB and rules out a possible bridging effect by other components such as 
those that may be present in the reticulocyte lysate used in experiments disclosed 
previously elsewhere herein and in, for example, Lorson et al. (1998, Nature Genet. 

10 19:63-66). Next, the effect on these interactions of two well characterized mutations 

found in SMA patients, the point mutant SMNY272C and the exon 7 deletion mutant 
(SMNAEx7), was examined. Figure 10B depicts that both mutations severely affected 
not only SMN self-association (Lorson et al., 1998, Nature Genet. 19:63-66) but also 
the interaction of SMN with SmB. In contrast, no effect was observed on the 

15 interaction of SMN with SIP1, which involves the amino terminus of SMN (as 

disclosed previously elsewhere herein). GST-SIP 1 binds equally efficiently to full- 
length SMN and to both mutants. GST alone, used as a control, exhibited no 
detectable binding to SMN. Because SMA is a motor neuron disease, the interaction of 
the neuronal-specific Sm protein, SmN, with SMN wild type (SMNwt), SMNY272C, 

20 and SMNAEx7 was also analyzed and found them to be identical to those of SmB. 

These data demonstrate that the SMN self-association and SmB-binding domains share 
common determinants within the YG box and that both SMN/SMN and SMN/SmB 
interactions are affected by mutations that cause SMA. A schematic summary of 
SMN-interacting domains is shown in Fig. 15 A. 

25 SMN Self- Association Enhances the Interaction with Sm Proteins. 

To determine whether or not these SMN/SMN and SMN/SIP1 
interactions were mutually exclusive, beads containing GST-SMN, or GST as a 
control, were preincubated with a molar excess of recombinant His-SMN to form SMN 
oligomers. Then, after washing away the unbound SMN, in vitro translated, 

30 [ 35 S]methionine-labeled SMN, SmB, or SIP1 was added to the beads and binding of the 
labeled proteins was assessed (Figure 1 1 A). SMN binding was reduced only partially 
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by the preincubation with recombinant His-SMN, suggesting that the oligomerization 
capacity of SMN on the beads has not been saturated. Surprisingly, SmB binding is 
dramatically enhanced by SMN self-association. SIP1 binding is slightly increased 
presumably because additional binding sites become available with the bound His- 

5 SMN. The specificity of this effect is demonstrated further by the lack of binding to 

control GST-bearing beads. Further, it was determined whether this effect was 
exhibited with other Sm proteins known to bind SMN (e.g., as discussed previously 
elsewhere herein). Fig. 1 IB depicts that SMN self-association greatly stimulates its 
interaction with SmB, SmDl , and SmD3 but not with SmD2 and SmE. SmF and SmG 

10 do not bind SMN under any conditions examined. The binding efficiency of Sm 
proteins to GST-SMN is lower than previously disclosed elsewhere herein (e.g., 
Example 1) because of the more stringent buffer conditions employed in this study. 

Several lines of evidence argue against the possibility that the increased 
binding of Sm proteins merely reflects the presence of additional interaction sites on 

15 the bound His-SMN: (i) even at the highest concentration tested, the amount of bound 

His-SMN is roughly equivalent to that of GST-SMN; (ii) consistently, SMN contains a 
binding site for SIP1 independent of that for Sm proteins, and SIP1 binding is only 
slightly increased by the addition of His-SMN (Figure 1 1 A);, and (iii) the effect of 
SMN self-association is not observed with SmD2 and SmE, whose basal binding 

20 otherwise is similar to the one of SmB, SmDl, and SmD3 (Figure 1 IB). These data 
indicate that SMN self-association and Sm proteins interaction are not mutually 
exclusive but, on the contrary, that SMN self-association very strongly and specifically 
increases its affinity for a subset of Sm proteins. 

A titration analysis of the stimulating effect of SMN oligomerization on 

25 SmB binding is depicted in Figure 12A and 12B. The amount of His-SMN bound to 

GST-SMN immobilized on glutathione-Sepharose beads, after the preincubation step, 
was determined by Western blotting. The increase in SmB binding correlated with the 
extent of SMN self-association (Fig. 12 A). When the bound amount of wild-type His- 
SMN was equivalent to the amount of GST-SMN on the beads, SmB binding increased 

30 approximately 10-fold (Fig. 12A, lane 4X). SmB binding increased about 30-fold, and 
more than about 60% of the input was bound when the amount of His-SMN was 
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approximately 2.5-fold greater than GST-SMN (Fig. 12A, lane 8X). Without wishing 
to be bound by theory, it appears that GST-SMN is predominantly in a monomeric 
form, probably because the GST fusion or the interaction of the GST with the 
glutathione-Sepharose interferes with the self-association of GST-SMN on the beads. 
Proportional to their reduced ability to self-associate, a greater molar excess of 
recombinant SMNY272C and SMNAEx7 than wild-type SMN (SMNwt) was required 
to obtain similar levels of association with GST-SMN wt. Although SMNY272C was 
still able to stimulate SmB binding, SMNAEx7 did not (Figure 12 A). In the case of 
SMNY272C self-association, the extent of stimulation of SmB binding was very low 
(Figure 12B). SMNAEx7 self-association was too inefficient to be analyzed. Thus, 
SMN self-association most likely creates a high-affinity binding site for Sm proteins 
and SMN mutations found in SMA patients affect the ability of SMN to form the Sm- 
binding site. Moreover, the binding site formed with the SMN mutants has a lower 
affinity for Sm proteins than that formed by wild-type SMN. 

SMN Oligomerization Is Impaired in Mutants of SMA Patients and Is 
Required for Binding to Sm Proteins. 

Although SMN self-associates and is part of a large, macromolecular 
complex in vivo, it could not be distinguished whether this is due to the presence of 
multiple copies of SMN and/or of additional proteins (see, e.g., Example 1). 
Moreover, previous in vitro experiments showing defective self-association of SMN 
mutants (Lorson et aL, 1998, Nature Genet. 19:63-66) were performed under solid-state 
conditions that did not allow a distinction between dimerization and oligomerization. 
Dimerization and oligomerization likely are different in terms of the interaction 
surfaces required for a protein to self-associate because oligomerization would involve 
at least two independent binding sites. Similarly, the data disclosed herein strongly 
suggest that SMN oligomerization enhances the interaction of SMN with Sm proteins, 
but no direct evidence that SMN indeed can oligomerize by itself had been provided so 
far. 

The ability of SMN to oligomerize was assessed using gel-filtration 
chromatography of purified recombinant His-tagged SMN wild-type and mutant 
proteins. Figure 13A depicts that SMN alone was able to form large oligomers of up to 
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a molecular mass corresponding to approximately 500 kDa. In contrast, mutants 
SMNY272C and SMNAEx7 were severely impaired in their ability to form oligomers. 
Next, the predicted requirement of SMN oligomerization for Sm protein interaction 
was examined. As depicted in Figure 13B, SmB associated with SMN large oligomers 
5 in the high-molecular-weight fractions. In contrast, no association between SMNAEx7 
and SmB as larger-size complexes could be detected. These results provide direct 
evidence that SMN is able to oligomerize and is found almost exclusively self- 
associated in large oligomeric complexes. Furthermore, SMN mutations found in 
SMA patients disrupt such oligomerization and the interaction with Sm proteins. 

10 Reduced Association of SMN Mutants with snRNPs in Vivo . 

The data disclosed previously herein suggest that SMN mutants found 
in vivo such as SMNY272C and SMNAEx7 would associate with Sm proteins less 
efficiently than SMN wild type. To test this hypothesis, 293T cells were transfected 
with either myc-tagged SMN wild type or SMNY272C or SMNAEx7. All the myc- 

15 tagged transfected proteins were expressed at similar levels as determined by Western 
blotting by using antibodies against the myc tag (Figure 14). By 
coimmunoprecipitation with anti-SIPl antibodies, comparable levels of wild-type and 
mutant proteins were detected associated with SIP1 as a SMN/SIP1 complex. 
However, immunoprecipitation using the anti-Sm mAb Y12 demonstrated that the 

20 association of SMNY272C and SMNAEx7 with Sm proteins is reduced markedly 

compared with the association with SMNwt. A complex comprising the SMN mutants 
and Sm proteins was still detected because the reduced ability of the mutants to form 
oligomers with the wild-type SMN is partially overcome by the overexpression and by 
the possible contribution of other SIPs in vivo (e.g., Example 1). 

25 Figure 1 5B depicts a summary model of several interactions in the SMN 

complex. SMN appears to be associated with SIP1 most, if not all, of the time (see, 
e.g., Example 1). Moreover, SIP1 interaction with SMN may have an effect on SMN 
oligomerization or Sm protein interaction. Monomeric SMN has only a low affinity 
for Sm proteins because a high-affinity Sm-binding domain forms only upon SMN 

30 oligomerization. For simplicity, SMN has been depicted bound to Sm proteins as a 
dimer; however, the actual stoichiometry of the SMN oligomers is not yet known. 
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Specific protein-protein interactions between the various Sm proteins are required for 
the ordered assembly of the Sm core (Raker et al., 1996, EMBO J. 15:2256-2269). The 
data disclosed herein demonstrate that SMN is able to form large oligomers, and, in 
such a conformation, it binds with high affinity to a subset of Sm proteins. Without 
5 wishing to be bound by any particular theory, it may be that the SMN oligomer is the 

functional core that allows the SMN complex to function in snRNP assembly (e.g., 
Example 2) and spliceosome regeneration (e.g., Example 4). Importantly, SMN 
mutations found in SMA patients directly affect SMN oligomerization and Sm protein 
binding. Thus, the loss-of-function phenotype of mutant proteins such as SMNY272C 

10 and SMNAEx7 in pre-mRNA splicing, as disclosed elsewhere herein, is most likely the 
direct result of an impaired interaction with the Sm proteins. These findings directly 
link the molecular mechanism of SMA to a deficiency in the interaction of SMN with 
spliceosomal snRNP Sm proteins. A detailed knowledge of the structure of the 
SMN/SIP1 complex with Sm proteins will lead to further insights into the mechanisms 

15 of SMN function and suggest possible therapeutic approaches for SMA. 

Example 4: Novel function of SMN in pre-mRNA splicing 

The experiments presented in this example may be summarized as 

follows. 

As discussed previously elsewhere herein, SMA is a common motor 
20 neuron degenerative disease that results from reduced levels of, or mutations in, the 
Survival of Motor Neurons (SMN) protein. SMN is found in the cytoplasm and the 
nucleus where it is concentrated in gems. SMN interacts with spliceosomal snRNP 
proteins and is critical for snRNP assembly in the cytoplasm. The data disclosed 
herein demonstrate that a dominant-negative mutant SMN (SMNAN27) causes a 
25 dramatic reorganization of snRNPs in the nucleus. Furthermore, SMNAN27 inhibits 

pre-mRNA splicing in vitro, while wild-type SMN stimulates splicing. SMN mutants 
found in SMA patients cannot stimulate splicing. These data demonstrate that SMN 
plays a crucial role in the generation of the pre-mRNA splicing machinery and thus in 
mRNA biogenesis, and the data link the function of SMN in this pathway to SMA. 
30 The Materials and Methods used in the experiments presented in this 

example are now described. 
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Constructs and recombinant proteins production 

DNA fragments corresponding to the open reading frames of SMN wild 
type (SMN wt), SMNY272C, SMNAEx7, and SMNAN27 were generated by PGR 
amplification using specific primers. In order to effect transient expression of the 
inserts in HeLa cells, the inserts were cloned downstream of the CMV promoter into a 
modified pcDNA3 vector (InVitrogen, Carlsbad, CA) further containing the myc-tag 
reporter protein sequence corresponding to the epitope recognized by the monoclonal 
antibody 9E10 (Siomi and Dreyfuss, 1995, J. Cell Biol. 129:551-560). 

For the production of purified recombinant proteins, the same inserts 
cloned into pcDNA3 were also cloned into the pET28 vector (Novagen Inc., Madison, 
WI). His6-SMN fusion proteins were expressed in the Kcoli strain BL21(DE3)pLysS 
and the proteins were purified by Ni 2+ chelation chromatography using the Novagen 
(Madison, WI) His-bind Buffer Kit following the manufacturer's protocol. Purified 
recombinant proteins were dialyzed against buffer D as described by Dignam et al. 
(1983, Nuc. Acids Res. 11:1475-1489). 

The Ad-2 AIVS construct, containing LI exon (41 nucleotides), a 
deleted form of the first intervening sequence (IVS1, 231 nucleotides) and L2 exon (72 
nucleotides) of adenovirus 2 (Ad-2) major late transcription unit, was generated by 
PCR amplification using pRSP-l-AIVS as a template (Konarska et al., 1984, Cell 
38:731-736) and the construct was cloned into pcDNA3 (InVitrogen, Carlsbad, CA). 

Cell culture and treatments 

HeLa cells were cultured in Dulbecco's modified Eagle's medium 
(DMEM, GIBCO-BRL, Gaithersburg, MD) supplemented with 10% fetal calf serum 
(FCS; GIBCO-BRL). 

HeLa cells, plated on glass coverslips, were transfected using the 
standard calcium phosphate method. Following overnight incubation with DNA, the 
cells were washed and fresh medium was added. Transfected cells were then fixed and 
processed for immunofluorescence staining after an additional 24-36 hours of 
incubation as set forth previously herein. 

Immunofluorescence microscopy 

Immunofluorescence staining was carried out essentially as previously 
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described (Choi and Dreyfuss, 1984, J. Cell. Biol. 99:1997-2004). Double-label 
immunofluorescence experiments were performed by separate sequential incubations 
of each primary antibody, diluted 1 : 1000 in PBS containing 3% (w/v) BSA, followed 
by the specific secondary coupled to fluoresceineisothiocyanate (FITC) or Texas Red 

5 (TXRD). All antibody incubations were performed at room temperature for 1 hour. 
Laser confocal fluorescence microscopy was performed using a Leica TCS 4D 
(Germany) confocal microscope. Images from each channel were recorded separately 
and then the data files were merged. Antibodies used in these experiments were as 
follows: Rabbit polyserum anti-p80 coilin (R288, described in Andrade et al., 1993, 

10 Proc. Natl. Acad. Sci. USA 90:1947-1951), mouse IgGl monoclonal anti-SMN (2B1), 
mouse IgGl monoclonal anti-TMG cap (K121; Krainer, 1988, Nucleic Acids Res. 
16:9415-9429) (Calbiochem, San Diego, CA), mouse IgG3 monoclonal anti-Sm (Y12; 
Lemer et al., 1981, Proc. Natl. Acad. Sci. USA 78:2737-2741), mouse IgGl 
monoclonal anti-myc (9E10), and affinity purified rabbit polyserum anti-myc (A- 14, 

1 5 obtained from Santa Cruz Biotech, Santa Cruz, CA). 

In situ hybridization was performed following the protocol described by 
Matera and Ward (1993, J. Cell Biol. 121:715-727). Briefly, the oligonucleotide probe 
was a biotinylated 2'-OMe RNA complementary to U2 snRNA region 30-43 
(Wassarman and Steitz, 1991, Mol. Cell. Biol. 1 1 :3432-3445) and the biotinylated 

20 probe was detected using streptavidin directly coupled to TXRD (GIBCO-BRL). 

In vitro transcription and splicing assay 

For use as templates for in vitro transcription, pSP14-15 (Ohno and 
Shimura, 1996, Genes & Dev. 10:997-1007) and Ad-2 AIVS were linearized with 
Smal and Xbal, respectively. In vitro transcription was carried out and transcribed 

25 RNAs were purified as described by Ohno and Shimura (1996, Genes & Dev. 10:997- 

1007). In vitro splicing assays in 10 |Ltl were performed essentially as described (Ohno 
and Shimura, supra) except the amount of HeLa cell nuclear extract used was reduced 
by a half (13 milligrams per milliliter) in the reaction mixture. In the pre-incubation 
experiments, the splicing mixture was incubated with the indicated amount of either 

30 SMNwt, SMNY272C, SMNAEx7, or SMNAN27 recombinant protein for 20 minutes 
at 30°C. Pre-mRNA was added and the mixture was incubated for an additional 20 or 
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40 minutes at 30°C. For antibody inhibition experiments, all antibodies were purified 
using a protein-G Sepharose column (Pharmacia LKB Biotechnology Inc., Piscataway, 
NJ) and then dialyzed against buffer D (Dignam et al., 1983, Nucl. Acids Res. 
1 1 : 1475-1489). Purified antibodies were incubated with HeLa cell nuclear extract and 
5 10XSP buffer (Ohno and Shimura, 1996, Genes & Dev. 10:997-1007) for 20 minutes 

at 30°C. After addition of labeled pre-mRNA, the mixture was incubated for an 
additional 20 or 40 minutes at 30°C. RNA products were analyzed by electrophoresis 
, on a 6% acrylamide/8.3 M urea gel for pSP14-15 RNA, and on a 10% 
acrylamide/8.3M urea gel for Ad-2 AIVS RNA, followed by autoradiography. 
10 Analysis of splicing complex formation by RNP gel electrophoresis 

For analysis of splicing complexes, splicing reactions were carried out 
as described previously herein using a- 32 P-labeled chicken 8-crystalline pre-mRNA. 
After a 30 minute incubation at 30oC, 50 micrograms of heparin (Sigma) were added 
to each reaction and the reaction was placed on ice for 10 minutes. An RNP native gel 
15 electrophoresis on a 3.75% polyacrylamide gel was performed as described in 
Konarska (1989, Methods Enzymol. 180:442-453). 

The Results of the experiments presented in this example are now 

described. 

Dominant-negative SMN mutant causes reorganization of snRNPs. 
20 gems, and coiled bodies 

To define the functional domains of SMN, various deletion mutants 
were constructed and their interactions with SIP1 and the Sm proteins were examined 
both in vitro by binding assays, and in vivo by co-immunoprecipitation after 
transfections into mammalian cells. Moreover, the expression and cellular localization 
25 of the myc-tagged mutants was monitored as well as their effect on snRNP 

localization. A particularly striking effect was observed by transfection of amino 
terminal deletion mutants, and one of these, a mutant lacking the first 27 amino acids 
of SMN, SMNAN27, was investigated in detail. 

Double-label immunofluorescence using anti-myc-tag antibodies to 
30 detect either the transfected myc-SMN or myc-SMNAN27, and the anti-Sm antibody 
Y12, demonstrated accumulation in the cytoplasm of Sm proteins colocalized with the 
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mutant SMNAN27 in discrete aggregates (Figure 16 A). In the nucleus, a striking 
rearrangement of snRNPs (and possibly also some Sm proteins) was observed and they 
also co-localized with SMNAN27. In contrast, in cells transfected with wild-type myc- 
tagged SMN (Figure 16A and 16C), as in untransfected cells, there was a barely 

5 detectable signal of Sm proteins in the cytoplasm, and the nuclear staining of Sm 

snRNPs demonstrated general nucleoplasmic distribution with higher local 
concentrations in interchromatin granules (also referred to as "speckles"), and 
particularly intense staining in coiled bodies (Carmo-Fonseca et al., 1991, EMBO J. 
10:195-206; Huang and Spector, 1992, Proc. Natl. Acad. Sci. USA 89:305-308). In 

10 these cells, SMN staining of gems was visible adjacent to and in most cases 

overlapping with that of coiled bodies (Figure 16C). There was a higher diffuse 
nucleoplasmic staining of myc-SMN than is normally seen in untransfected cells likely 
due to the overexpression of the protein, but the size, number and localization of gems 
was unchanged relative to untransfected cells. 

15 In the SMNAN27 transfected cells, endogenous SMN and SIP1 were 

completely co-localized with SMNAN27 and with snRNPs. The number of the merged 
SMN- and snRNP -containing bodies in the SMNAN27 transfected cells was greater 
than the number of gems (or of coiled bodies) in control cells, and they were much 
larger than gems and coiled bodies. These merged structures could become as large as 

20 the nucleoli. 

The localization of a specific spliceosomal snRNA in cells transfected 
with SMNAN27 was also studied. To do so, the localization of IJ2 snRNA was 
determined by in situ hybridization using a U2-specific antisense probe as described by 
Matera and Ward (1993, J. Cell Biol. 121:715-727). Figure 17A and 17C demonstrate 

25 that the cytoplasmic accumulations that stained intensely for both the SMNAN27 and 

Sm proteins also contained U2 snRNA, while cells transfected with wild-type SMN 
exhibited only slight cytoplasmic background staining. This suggests that complexes 
containing Sm proteins, snRNAs and SMN accumulated in the cytoplasm of 
SMNAN27 but not SMN wt cells, likely representing a block in the pathway of 

30 cytoplasmic snRNP assembly. 

This block in cytoplasmic snRNP assembly was examined further by 
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immunostaining using an anti-trimethyl-G cap (TMG)specific antibody. The TMG cap 
is formed by hypermethylation of the 5 f monomethyl G cap of the snRNAs, and this 
step has been shown to take place in the cytoplasm after Sm core assembly has 
occurred (Mattaj, 1986, Cell 46:905-91 1; Fischer and Luhrmann, 1990, Science 

5 249:786-790). The immunostaining indicated that although the cytoplasmic 

accumulations caused by SMNAN27 contained U2 as well as Sm proteins and SMN, 
the snRNAs within them did not receive the TMG modification (Figure 17D and 17F). 
Although the possibility that lack of TMG immunostaining in the cytoplasm is a result 
of masking cannot be excluded, the complete absence of signal and the strong staining 

10 in the nucleus make this unlikely. Thus, without wishing to be bound by theory, it 
appears that the snRNP assembly pathway in the cytoplasm is arrested at a step 
preceding the cap hypermethylation. The snRNAs in the nuclei of the same cells, 
however, are hypermethylated, since they stain efficiently with the anti-TMG antibody. 
These effects of SMNAN27 suggest that the construct has a dominant-negative 

1 5 phenotype over wild type SMN. These data also suggest that SMN and SIP1 (£ e. , 

Gemin2) interact with Sm proteins in the cytoplasm, and, importantly, also with 
snRNPs in the nucleus. Further, these data demonstrate that mutations in SMN can 
have profound effects on both the biogenesis and localization of spliceosomal snRNPs 
in the cell. 

20 Gems, the structures in which SMN and SIP1 are most highly 

concentrated in the nucleus, are usually found adjacent to and often merged with coiled 
bodies (Liu and Dreyfiiss, 1996, EMBO J. 15:3555-3565; Liu et al., 1997, Cell 
90:1013-1021). Therefore, it was determined whether the SMN mutant, SMNAN27, 
had any effect on the structure or organization of coiled bodies by staining the cells 

25 with antibodies to p80 coilin, a coiled bodies-specific marker (Andrade et al., 1991, J. 
Exp. Med. 173:1407-1419). The images depicting immunomicrographs demonstrate 
that coiled bodies and gems became completely merged in the nucleus and that the p80 
coilin staining completely coincided with the staining of the enlarged gems (Figure 
18A-F). These enlarged merged nuclear structures therefore contained snRNPs, coiled 

30 bodies, and gem components. The reorganization of coiled bodies that SMNAN27 
brings about suggests a functional relationship between SMN and coiled bodies. 
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SMNAN27 inhibits pre-mRNA splicing when added during 

preincubation 

The profound effect of SMN on the organization of nuclear 
snRNPs, particularly the formation of large snRNP-containing aggregates which are 

5 similar in appearance to those observed in cells in which snRNPs are inactivated by 

antisense oligonucleotides (O'Keefe et al, 1994, J. Cell Biol. 124:249-260) or when 
transcription is inhibited by actinomycin D (Zeng et al., 1997, EMBO J. 16:1401- 
1412), suggest that SMNAN27 can cause an inactivation of snRNPs and, possibly, of 
other pre-mRNA splicing factors. To address this directly, the effect of SMNAN27 on 

10 pre-mRNA splicing was examined in an in vitro system. 

SMN wt and SMNAN27 were produced in bacteria as recombinant 
proteins bearing a His-tag, the fusion proteins were purified to homogeneity and then 
they were added to nuclear splicing extracts. 32 P4abeled chicken 5-crystallin pre- 
mRNA was used as a splicing probe and the reaction products were analyzed by gel 

15 electrophoresis. Addition of SMN wt or the SMNAN27 mutant fusion protein to the 

reaction at time zero had almost no effect on the splicing reaction, although both fusion 
proteins appeared to stabilize the intron (Figure 19A). 

It was next determined whether SMN, while not a splicing factor per se, 
may be important for some function akin to its role in snRNP assembly in the 

20 cytoplasm. That is, although nuclear snRNPs are thought to be stable and fully 

assembled, they may need to be regenerated or reassembled between rounds of 
splicing, and SMN may be required for this process. To address this possibility, the 
splicing extract was allowed to run in the presence of SMNAN27 before the labeled 
pre-mRNA probe was added. Therefore, the extract was pre-incubated with all the 

25 components, including an energy generating system and SMN wt or SMNAN27, but 

without the pre-mRNA for 20 minutes, at which time the 5-crystallin pre-mRNA 
probe was added and the reaction was allowed to proceed for an 
additional 20 or 40 minutes. A strong inhibition of splicing was 
observed only in the samples to which the SMNAN27 was added during 

30 the pre-incubation period (Figure 19B). Identical amounts of SMN wt 

and of SMNAN27 were used in the splicing reactions, and the inhibition 
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demonstrated in samples preincubated with SMNAN27 was 
concentration-dependent. About three- to five-fold mass excess of 
SMNAN27 over endogenous SMN was sufficient for complete inhibition 
of 5-crystallin pre-mRNA splicing. 

SMNAN27 blocks the formation of spliceosomal complex C and 
SMNwt stimulates spliceosome formation when added during preincubation 

The observation that preincubation with SMNAN27 causes inhibition of 
splicing, including a block to the first step of the pre-mRNA reaction, cleavage at the 5' 
splice site, and lariat intron formation, indicated a block to an early step in the 
spliceosome formation. In order to characterize this block in greater detail and to 
determine whether the snRNPs could interact with the pre-mRNA at all, the RNP 
complexes were analyzed by native gel electrophoresis (Konarska, 1989, Methods 
Enzymol. 180:442-453). Immediately upon addition to a nuclear extract, the pre- 
mRNA rapidly forms an ATP-independent complex with endogenous RNA-binding 
hnRNP proteins termed H complex (Figure 20, lane 1). With further incubation at 
30oC in the presence of ATP, several larger complexes are observed (Figure 20, lane 
2), which are designated A, B, and C complex (Figure 23; see also Ohno and Shimura, 
1996, Genes & Dev. 10:997-1007). Consistent with the notion that allowing the 
extract to preincubate under splicing conditions leads to a reduction in active 
components, the capacity of the extract to form splicing complexes was significantly 
reduced during a 20 minute preincubation (Figure 20, lane 3). However, SMNwt 
prevented this reduction so that the extract retained and in fact exhibited increased 
efficiency of splicing complex formation (Figure 20, lane 4). In contrast, although 
extracts to which SMNAN27 has been added show a higher capacity to form A and B 
complexes compared with untreated extract, the formation of the C complex, which is 
the mature spliceosome, is completely inhibited (Figure 20, lane 5). These results 
suggest that SMN has an important role in maintaining snRNPs in active form and that 
the amino-terminal 27 amino acids of the protein are critical for this process. 

Antibodies to the amino terminus of SMN inhibit and antibodies to the 
carboxvl terminus stimulate pre-mRNA splicing when added during preincubation 

Amino terminal deletions in the SMN gene, or other dominant-negative 
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mutations in SMN, have not been found so far in SMA patients, who, in the vast 
majority of cases, exhibit only dramatic reduction in the level of the wild-type SMN 
protein. To mimic this situation and to address the role of SMN in pre-mRNA splicing, 
the available amount of the SMN protein was reduced without the addition of an SMN 
5 mutant. 

SMN is tightly associated with SIP1, this interaction being resistant to 
dissociation with 1 M NaCl, and SMN also associated with other proteins thus 
precluding the use of immunodepletion as a way of removing exclusively SMN from 
the extract. Therefore, purified anti-SMN monoclonal antibody (2B1) was added to the 

10 extract to determine whether anti-SMN could cause specific immunoinhibition of pre- 

mRNA splicing. The results, shown in Figure 21, demonstrate that 2B1 strongly 
inhibited splicing but, again, only if it was added during the pre-incubation period. The 
lack of inhibition seen when the antibody was added at the same time as the probe pre- 
mRNA indicated that 2B1 was not simply toxic to the reaction. Consistent with the 

15 dominant-negative effect of SMNAN27, the first 27 amino acids of SMN are part of 

the epitope recognized by 2B1 . The specificity of the 2B 1 effect was illustrated by the 
lack of inhibition by the control antibody SP2/0, and further by the lack of inhibition 
by an affinity purified rabbit polyclonal antibody raised against a SMN peptide 
corresponding to the extreme C-terminus encoded by exon 7. In fact, the anti-exon 7 

20 antibody had a strong stimulatory effect. Further, the data demonstrate that anti-SMN 
(2B1) was as strong an inhibitor of pre-mRNA splicing as the anti-snRNP antibody 
Y12 (Figure 22; Padgett etal., 1983, Cell 35:101-107). 

Unlike SMNwt SMN mutants found in SMA patients do not stimulate 
splicing when added during preincubation 

25 In addition to the homozygous deletion of both telomeric SMN genes, 

several point mutations and partial deletions of the carboxyl terminus of SMN have 
been identified in SMA patients as reviewed in Burghes (1997, Am. J. Hum. Genet. 
61 :9-15). The effects of two mutations leading to the severe type I SMA phenotype, 
the Y272C point mutation and the deletion of the amino acid sequence encoded by the 

30 exon 7 (Lefebvre et al., 1995, Cell 89:155-165; Burghes, 1997, Am. J. Hum. Genet. 

61:9-15), have been investigated. SMN deleted of exon 7 (designated SMNAEx7) is 
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also thought to be the main form produced by the centromeric SMN gene (Gennarelli et 
al., 1995, Biochem. Biophys. Res. Commun. 213:342-348). These loss-of-function 
mutations of SMN are recessive and result in the SMA phenotype in individuals 
carrying a deletion of the other telomeric SMN allele. As these are recessive mutations, 

5 they were not expected to produce the inhibitory effects of SMNAN27. Nonetheless, it 
was determined whether these recessive mutations could substitute for SMNwt in 
providing a stimulatory effect during preincubation. These experiments were 
performed using the adenovirus 2 major late transcription unit-derived pre-mRNA, Ad- 
2 AIVS (Konarska et al., 1984, Cell 38:731-736), rather than the 5-crystallin pre- 

10 mRNA because it was also sought to be determined whether the inhibitory effect of 
SMNAN27 was a general phenomenon, as opposed to an effect that is unique to 6- 
crystallin pre-mRNA. 

Similar to its effect on 8-crystallin pre-mRNA splicing, addition of 
SMNAN27 during the pre-incubation time inhibited the splicing of Ad-2 AIVS pre- 

15 mRNA (Figure 22). In contrast, addition of the wild-type SMN had a strong 

stimulatory effect on adenovirus pre-mRNA splicing. Neither SMNY272C nor 
SMNAEx7 had any detectable effect, demonstrating neither stimulation nor inhibition 
of pre-mRNA splicing. The recombinant proteins used in these experiments were 
produced side-by-side using the same procedure, and identical amounts were added to 

20 each assay. All these SMN recombinant proteins were able to directly bind a GST- 
SIP 1 fusion protein in vitro, further suggesting that the recombinant proteins were 
purified in native form. These findings demonstrate that SMN has an important and 
unexpected general role in pre-mRNA splicing, and these data demonstrate a direct 
connection between the molecular defect of SMA and the pre-mRNA splicing cycle. 

25 The data disclosed herein demonstrate that SMN has functions both in 

the cytoplasm and in the nucleus in both snRNP biogenesis and function. Previous 
experiments in Xenopus oocytes disclosed previously elsewhere herein demonstrated a 
role for SIP1 in the biogenesis of snRNPs by serving as an assembly factor for the 
snRNAs and the snRNP Sm core proteins. Those experiments, however, did not reveal 

30 as clear and direct a role for SMN as for SIP1, because while microinjections of anti- 
SIP1 antibodies strongly inhibited snRNP assembly, the anti-SMN antibodies had a 
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stimulatory effect {see Example 2). The experiments in somatic cells reported herein 
demonstrate that transfection of an SMN mutant blocks snRNP assembly in the 
cytoplasm, indicating that SMN, like SIP 1, is critical for this process. The data 
disclosed herein further demonstrate that the requirement for SMN in snRNP assembly 
5 is a general one and not a phenomenon that is unique to amphibian oocytes. 

Although the presence of SMN and SIP1 in the nucleus and their high 
concentration in gems, adjacent to and often merged with the snRNP-rich coiled 
bodies, suggested a function for these proteins in the activity of snRNPs in the nucleus, 
the strong effect of SMNAN27 on nuclear snRNP organization was unexpected. As 

10 this raised the possibility of an involvement of these proteins, particularly of SMN, in 

the activity of snRNPs in pre-mRNA splicing, this possibility was examined. The data 
disclosed herein point to a novel function of SMN in the pre-RNA splicing cycle and 
suggest that some components of the spliceosome, likely snRNPs but possibly also 
other components, require SMN for their function. The lack of an effect of SMNAN27 

1 5 upon its addition to a splicing extract indicates that SMN is neither a general inhibitor 

of splicing nor a splicing factor per se. Rather, inhibition of splicing by SMNAN27 
and by the anti-SMN antibody 2B1 are seen only if the mutant or the antibody are 
added to the extract and the reaction is preincubated before the pre-mRNA probe is 
added. This inhibition, which is not observed with wild-type SMN and which is 

20 general rather than pre-mRNA specific, suggest a role for SMN in regeneration of 

snRNPs (and possibly also of other components). Without wishing to be bound by any 
particular theory, the data disclosed herein suggest that during the incubation of the 
nuclear extract under splicing conditions, including an energy-regenerating system, 
some of its components, likely including snRNPs, become inactivated and their 

25 regeneration to functional form requires SMN. Indeed, addition of recombinant 

SMNwt greatly stimulates both spliceosome formation and splicing. In splicing 
competent extracts, endogenous SMN is sufficient to provide this activity, but addition 
of SMNAN27 causes a block to this regeneration process and acts as a dominant- 
negative mutant of SMN because it is able to engage in some of the same interactions 

30 as SMN but the mutant protein traps complexes in a nonfunctional state. The 
interactions of SMNAN27 with Sm proteins, with SIP1, and with SMN are 
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indistinguishable from that of wild-type SMN. SMN mutations found in SMA patients 
do not show either a stimulation or an inhibition of splicing, confirming that they 
represent loss-of-function recessive mutants. Since SMNY272C and SMNAEx7 have 
a highly reduced oligomerization capacity (Lorson et al., 1998, Nature Genet. 19:63- 

5 66), SMN oligomerization may be required for its splicing-regenerating activity. It 

should be noted that although the general term "regeneration" is used to describe this 
function of SMN, the actual mechanism involved is not yet known and the present 
invention is not limited to any particular mechanism of SMN action. 

Several recycling factors that are essential for splicing, mostly 

10 DEAD/DEAH box RNA helicases, have been described previously. Recycling factors, 
exemplified by S. cerevisiae Prp22 and Prp43, function in the disassembly of snRNPs, 
splicing factors, intron lariat, and spliced mRNA from the spliceosome (Staley and 
Guthrie, 1998, Cell 92:315-326). For example, Prp22 is needed for releasing the 
mRNA from the spliceosome and therefore yeast cells that are made deficient in 

15 functional Prp22 accumulate mRNA on spliceosomes in the nucleus but there is no 

block in pre-mRNA splicing (Company et al., 1991, Nature 349:487-493). Similarly, 
deficiency in Prp43, a recycling factor required for disassembly of U snRNPs-intron 
lariat complex, does not result in inhibition of splicing (Arenas and Abelson, 1997, 
Proc. Natl. Acad. Sci. USA 94:11798-11802). Splicing extracts preincubated with 

20 SMNAN27 do not show any cleavage of the pre-mRNA indicating that even the first 
step in the splicing reaction, i.e., the cleavage at the 5' splice site and the concomitant 
formation of the intron lariat, has not taken place. This is somewhat similar to what is 
seen upon depletion of Prp24, the recycling factor required for re-annealing of U4 and 
U6 snRNPs (Raghunathan and Guthrie, 1998, Science 279:857-860). As disclosed 

25 herein, the RNP gel analysis suggests that there is a deficiency in functional 

components that are required for spliceosome complex C formation. Future 
experiments will characterize the block to C complex formation and attempt to define 
the specific defect that results from incubation of the extract with SMNAN27 or from 
reduced levels of SMN. Further, unlike recycling factors described so far, SMN and 

30 SIP1 do not contain DEAD/DEAH motifs. However, as disclosed previously 

elsewhere herein, additional proteins are found in the SMN/SIP1 complex, and it is 
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possible that one of these proteins has such an activity. Indeed, the data disclosed 
elsewhere herein demonstrate that a novel SMN-interacting protein, Gemin3, contains 
a DEAD/DEAH motif (see Example 6). 

Without wishing to be bound by any particular theory, the data 
5 disclosed herein demonstrate that the factors requiring regeneration, presumably 
snRNPs, must exist in two states, inactive and active, and that the conversion of 
inactive to active forms requires SMN. The conversion of active to inactive may be a 
consequence of their function in splicing or it may be an intrinsic switch that they 
undergo independent of splicing. By analogy to the function of SMN and SIP1 in the 

10 cytoplasm, it may be that snRNPs in the nucleus, although thought of as stable, fully 
assembled, and functional RNPs, in fact undergo some disassembly or rearrangements, 
and SMN and SIP1 are required for their reassembly to regenerate them into functional 
form. A scheme depicting this proposed view of the place of SMN in the pre-mRNA 
splicing cycle is shown in Figure 23. The nuclear functions of SIP1 are not yet known 

1 5 and are being investigated. 

Coiled bodies were first described in 1903 by Ramon y Cajal, who 
observed them in neuronal cells and named them nucleolar accessory bodies (Ramon y 
Cajal, 1903, Trab. Lab. Invest. Biol. 2:129-221). Over the past several years, coiled 
bodies have received much renewed interest as they have been found to contain the 

20 highest concentration of snRNPs in the nucleus, and additional data have raised the 

possibility that they have important, albeit not clearly defined, roles in the formation of 
both splicing components and the nucleolus (Raska et al., 1990, J. Struct. Biol. 
104:120-127; Lamond and Carmo-Fonesca, 1993, Trends Cell Biol. 3:198-204; 
BohmannetaL, 1995, J. Cell Biol. 131:817-831; Roth, 1995, Curr. Opin. Cell Biol. 

25 7:325-328; Lamond and Earnshaw, 1998, Science 280:547-553). The effect of 

SMNAN27 on coiled bodies provides insight into the function of both gems and coiled 
bodies. It indicates that there is a functional relationship between these two structures 
and, in particular, that SMN affects the organization, and likely the function, of coiled 
bodies. It is possible that gems and coiled bodies represent two stations in a pathway 

30 along which snRNPs need to proceed to become functional. At this stage, the order of 
steps between these stations cannot be assigned. The data disclosed herein directly link 
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the functions of coiled bodies and gems to pre-mRNA splicing components, 
specifically to a regeneration of snRNPs. 

The functions disclosed for the first time herein for SMN and SIP1 
define them as critical proteins for the generation of the splicing machinery and thus 

5 ultimately for the process of mRNA biogenesis. Consistent with a housekeeping 

function for SMN, SMN knockout mice display an early embryonic lethal phenotype 
(Schrank et al., 1997, Proc. Natl. Acad. Sci. USA 94:9920-9925). Most SMA patients, 
particularly those with the fatal infant form, SMA type I, have a drastic reduction in the 
amount of SMN in motor neurons (Lefebvre et al., 1997, Nature Genet. 16:265-269). 

10 In some cases, SMA patients produce a protein bearing a recessive mutation like the 

SMNY272C and SMNAEx7 analyzed here. It is therefore important that, as disclosed 
herein, these mutants do not have the capacity to provide the activity of wild-type SMN 
in splicing. Together, these novel findings presented here strongly suggest that motor 
neurons of SMA patients are impaired in their capacity to produce mRNAs and as a 

15 result, become deficient in proteins that are necessary for the growth and functions of 

these cells. 

In conclusion, SMA is a human splicing disease, and better knowledge 
of the precise molecular details of the mechanism of action of SMN in this process will 
likely facilitate the search for a therapeutic approach to this devastating disease. 
20 Although SMA is considered to be an autosomal recessive disease, the phenotype of 
SMNAN27 demonstrates that some mutations in SMN will have a dominant-negative 
lethal phenotype. 

Example 5: Gemin3: A novel DEAD box protein that is a component 
of gems and that interacts with SMN 
25 The experiments presented in this example may be summarized as 

follows. 

The data disclosed herein demonstrate the cloning and characterization 
of a heretofore unknown protein that is a component of the SMN complex, a novel 
DEAD box putative RNA helicase, designated "Gemin3" (for protein component of 
30 Gem s number 3). Gemin3 interacts directly with SMN as well as with SmB, SmD2 
and SmD3. Immunolocalization studies using monoclonal antibodies to Gemin3 
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demonstrated that Gemin3 co-localizes with SMN in gems. Gemin3 binds with SMN 
via its unique C -terminal domain, and SMN mutations found in some SMA patients 
strongly reduce this interaction. Unlike SMN and SIP1 which do not contain 
DEAD/DEAH motifs (reviewed in Staley and Guthrie, 1998, Cell 92:315-326), 
5 Gemin3 contains a DEAD box motif indicating that the protein is a putative helicase. 

Thus, the presence of a DEAD motif in Gemin3 suggests that this protein may provide 
a catalytic activity that plays a critical role in the function of the SMN complex on 
RNPs. 

The data disclosed herein further demonstrate the production of 

10 monoclonal antibodies to Gemin3. In addition, immunofluorescence microscopy 
demonstrated that Gemin3 co-localizes with SMN in gems. Like SMN and SIP1, 
Gemin3 can be isolated in a complex with several spliceosomal snRNP proteins. 
Moreover, the data disclosed demonstrate that Gemin3 interacts directly with SMN and 
with several of the spliceosomal snRNP core Sm proteins including the B and D2-3 

15 proteins. The unique C-terminal domain of Gemin3 mediates interaction with SMN 

and localization of Gemin3 to gems. The discovery of a DEAD box protein (i.e., a 
likely RNA helicase) in the SMN complex is of particular interest as the functions 
revealed so far suggest that this complex has crucial activities in the biogenesis of 
RNPs. To perform such functions, including assembly of the snRNPs and the 

20 regeneration of active components of the spliceosome, the SMN complex may effect 

structural changes in its RNP targets. Of the known components of the SMN complex, 
the DEAD box protein Gemin3 is the most likely protein to have the capacity to 
perform such a function. Importantly, SMN proteins with mutations found in SMA 
patients exhibited significantly reduced interaction with Gemin3 suggesting that the 

25 SMN complexes in these patients are be deficient in this protein. 

The Materials and Methods used in the experiments presented in this 
example are now described. 

Identification of pi 05 protein by mass spectrometry 

The pl05 protein was co-immonoprecipitated with anti-SMN 

30 monoclonal antibody 2B1 and the band was excised from a single one-dimensional 
Coomassie stained polyacrylamide gel. The protein band was digested with trypsin 
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(unmodified trypsin, sequencing grade, Boehringer Mannheim, Indianapolis, IN) in-gel 
as described in Shevchenko et al. (1996, Anal Chem. 68:850-858). Tryptic peptides 
were extracted and recovered from the gel using 5 % formic acid and acetonitrile. The 
combined extracts were pooled together, dried in a speed vacuum, and the dried pellets 
5 were redissolved in 5 % formic acid. The peptides were analyzed by nanoelectrospray 

tandem mass spectrometry as described in Wilm and Mann (1996, Anal. chem. 66:1-8). 
Nano ES MS/MS was performed on a API III triple quadrupole instrument (PE Sciex, 
Ontario, Canada) equipped with a nanoelectrospray ions source developed in EMBL 
(Wilm and Mann, 1996, Anal. chem. 66:1-8). 
10 Comprehensive protein and EST databases were searched using 

PeptideSearch version 3.0 software. No limitations on protein molecular weight and 
species of origin were imposed. 

Production of proteins in vitro 

[ 35 S]methionine labeled proteins were produced by an in vitro coupled 
15 transcription-translation reaction (Promega Biotech, Madison, WI) in the presence of 
[ 35 S]methionine (Amersham, Arlington Heights, IL). Gemin3 and SMN fusion 
proteins, comprising a tag polypeptide comprising a run of histidine amino acid 
residues (6His) to produce 6His-Gemin3 and 6His-SMN fusion protein, were expressed 
from a pET bacterial expression system in the E. coli strain BL21(DE3) and the fusion 
20 proteins were purified using nickel chelation chromatography using a kit (His-Bind 
buffer kit; Novagen, Madison, WI) per the manufacturer's instructions. 

Gemin3 fusion protein comprising a glutathione-S-transferase (GST) tag 
polypeptide (i.e., GST-Gemin3) was expressed using a GST expression vector (pGEX- 
5X-3; Pharmacia LKB Biotechnology Inc., Piscataway, NJ) in the E. coli strain BL21. 
25 The Gemin3-GST fusion protein was purified using glutathione-Sepharose (Pharmacia 
LKB Biotechnology Inc., Piscataway, NJ) according to the manufacturer's protocol. 

Production of monoclonal antibodies that specifically bind Gemin3 
Anti-Gemin3 antibodies, designated 1 1G9 and 12H12, were prepared by 
immunizing BALB/c mice with 6His-tag C-terminal domain of Gemin3 (comprising 
30 from about amino acid residue number 368 to about 548) purified from nickel chelation 
chromatography using a Novagen (Madison, WI) His-Bind buffer kit per the 
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manufacturer's instructions. Hybridoma production, screening and ascites fluid 
production were performed as described previously by Choi and Dreyfuss (1984, J. 
Cell. Biol. 99:1997-2004). 

I mmunoprecipitation and immunoblotting 

Immunoprecipitations of in vitro translated proteins were carried out in 
the presence of 1% Empigen BB buffer as previously described (Choi and Dreyfuss, 
1984, J. Cell Biol. 99:1997-2004). Immunoprecipitations of SMN, the Sm proteins 
and Gemin3 from cells were carried out using total HeLa cell lysate in the presence of 
1% Empigen BB buffer as previously described (Choi and Dreyfuss, 1984, supra). 
Immunoprecipitations and purifications of the SMN, Gemin2, Sm and Gemin3 
complexes were carried out using total HeLa cell lysate in the presence of 0.5% 
TritonX-100 as previously described in Pinol-Roma et al. (1988, Genes Dev. 2:215- 
227). 

For immunoblotting, the HeLa cell lysate proteins were resolved using 
12.5% SDS-polyacrylamide gels and the proteins were then transferred to a 
nitrocellulose membrane (Schneider and Schuell, Inc., Keene, NH) using a BioTrans 
Model B Transblot apparatus (Gelman Science) according to the manufacturer's 
instructions. The membranes were then incubated in blocking solution (phosphate- 
buffered saline, PBS, containing 5% (w/v) nonfat milk) for at least 1 hour at room 
temperature. The membranes were rinsed with cold PBS, and then were incubated in 
blocking solution containing a selected primary antibody for at least 1 hour at room 
temperature. The membranes were subsequently washed three times in PBS containing 
0.05% (v/v) NP-40, and any bound antibodies were detected using peroxidase- 
conjugated goat anti-mouse IgG plus IgM (Jackson Immunoresearch Laboratories, 
West Grove, PA). The antibody-decorated protein bands were visualized using an 
enhanced chemiluminescence (ECL) Western blotting kit (Amersham, Arlington 
Heights, IL) after washing the membranes an additional three times with PBS 
containing 0.05% (v/v) NP-40. 

Cell culture and treatments 

HeLa cells were cultured in Dulbecco's modified Eagle's medium 
(DMEM; GIBCO BRL, Gaithersburg, MD) supplemented with 10% (v/v) fetal bovine 
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serum (FBS; GIBCO BRL, Gaithersburg, MD). 

Immunofluorescence microscopy 

Immunofluorescence staining was carried out essentially as described 
previously in Choi and Dreyfuss (1984, J. Cell. Biol. 99:1997-2004). Double-label 
immunofluorescence experiments were performed using separate, sequential 
incubations of each primary antibody each diluted in PBS containing 3% (w/v) bovine 
serum albumin (BSA) followed by incubation using the specific secondary antibody 
coupled to either fluorescein isothiocyanate (FITC) or Texas Red. All incubations in 
antibodies were carried out at room temperature for 1 hour. Laser confocal 
fluorescence microscopy was performed using a Leica Model TCS 4D confocal 
microscope (Leica, Inc., Exton, PA). Images from each channel were recorded 
separately and stored in separate data files. Subsequently, where indicated, the 
separate data files were merged. 

The antibodies used in these experiments were as follows: mouse IgGl 
monoclonal anti-Gemin3 (1 1G9 and 12H12), mouse IgGl monoclonal anti-SMN 
(2B1); rabbit polyserum anti-p80 coilin (R288, Andrade et al., 1993, Proc. Natl. Acad. 
Sci. USA 90:1947-1951); mouse IgG3 monoclonal anti-Sm (Y12, Lemer et al., 1981, 
Proc. Natl. Acad. Sci. USA 78:2737-2741) and SP2/0, a non-immunoglobulin chains 
secreting mouse hybridoma (American Type Tissue Collection [ATTC], Rockville, 
MD). The rabbit affinity purified anti-Exon 7 antibody is described elsewhere herein. 

In vitro protein-binding assay 

Purified GST or GST-fusion proteins (2 micrograms) bound to 25 nl of 
glutathione-Sepharose beads were incubated with 10 6 cpm of the in vitro translated 
protein mixture in 1 ml of binding buffer (50 mM Tris-HCl [pH7.5], 200 mM NaCl, 2 
mM EDTA, 0.1% NP40, 2 micrograms per milliliter leupeptin and pepstatin A, and 
0.5% aprotinin). Following incubation for 1 hour at 4°C, the resin was washed five 
times with 1 ml of binding buffer. The bound protein fraction was eluted by boiling 
the beads in SDS-PAGE sample buffer, and the eluted proteins were run on SDS- 
PAGE as described previously elsewhere herein. The gels were fixed for 30 minutes 
and the radiolabeled signal was enhanced by treating the gels with Amplify solution 
(Amersham, Arlington Heights, IL). 
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For direct in vitro binding, purified GST or GST-Gemin3 proteins (2 
micrograms) bound to 25 microliters of glutathione-Sepharose beads were incubated 
with 5 micrograms of purified 6His-tag-SMN or 6His-tag-SmB in 1 ml of binding 
buffer (50 mM Tris-HCl [pH7.5], 100 mM NaCl, 2 mM EDTA, 0.05% NP-40, 2 
5 micrograms per milliliter leupeptin and pepstatin A, and 0.5% aprotinin). After 

incubation for 1 hour at 4°C, the beads were washed five times with 1 ml of binding 
buffer per wash. The bound protein fraction was eluted by boiling the beads in SDS- 
PAGE sample buffer, and any 6His-SMN and/or 6His-SmB proteins eluted from the 
beads were analyzed using SDS-PAGE and Western blot using a rabbit polyclonal anti- 
10 His-tag antibody (Santa-Cruz Biotech., Santa Cruz, CA). 

Cell fractionation and chromatography 

HeLa cells were fractionated as described in Dignam et al. (1983, Nucl. 

Acids Res. 1 1 :1475-1489 1983). S100 fractions (400 microliters comprising about 20 

milligrams per milliliter protein) in buffer F (20 mM Tris-HCl [pH 7.4], 0.1 mM 
15 EDTA, 1 mM DTT, 10% Glycerol, 500 mM KC1) were loaded on a Superose 6 HR 

1 0/30 column (Pharmacia LKB Biotechnology Inc., Piscataway, NJ). The column was 

then washed with buffer A (20 mM Tris-HCl [pH 7.4], 200 mM NaCl, 2.5% Glycerol). 

Selected protein fractions (0.5 ml) were collected, and 30 |xl of each fraction was 

resolved on SDS-PAGE followed by Western blotting. 
20 The Results of the experiments presented in this example are now 

described. 

Gemin3» a novel SMN-interactine protein with a DEAD box RNA 
helicase domain 

Immunoprecipitations from [ 35 S]methionine labeled Hela cell lysates 
25 with antiSMN and anti-SIPl monoclonal antibodies disclosed the presence of several 

protein components in the SMN-SIP1 complex (Liu et al., 1997, Cell 90:1013-1021). 
Among the proteins that can be co-immunopurified with anti-SMN and anti-SIPl 
antibodies, only some of the major low molecular mass proteins, identified as the Sm 
proteins, have so far been characterized as disclosed elsewhere herein. In addition to 
30 SMN, SIP1, and the Sm proteins, there is a doublet at about 97 kDa and additional 

bands at 175 kDa, 95 kDa, 60 kDa and 50 kDa that co-immunopurified with the anti- 
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SMN antibody. The two proteins of the 97 kDa doublet were eluted from the gel, 
digested with trypsin, and the resulting peptides were sequenced using 
nanoelectrospray mass spectrometry as described previously (Wilm et al., 1996, Anal. 
Chem. 66:1-8; Shevchenko et al., 1996, Anal. Chemistry 68: 850-858). The data 

5 disclosed herein demonstrate the molecular cloning and characterization of the high 

molecular weight protein of this doublet (i.e. 9 pi 05). Several peptides from this band 
identified a human EST sequence (clone #AA303940) using the peptide sequence tag 
algorithm (Figure 2 A and 2B. Several additional cDNA clones were obtained by 
hybridization screening of a human leukemia 5 ! -STRETCH PLUS cDNA library using 

10 this EST clone as a probe. Twelve independent partial cDNA clones with insert sizes 

ranging from about 1 to 2.5 kb, all of which contained overlapping regions of the same 
open reading frame (ORF), were isolated. 5' RACE PCR was used to extend this 
cDNA further upstream. A cDNA clone containing the longest ORF was constructed 
and conceptual translation of its nucleotide sequence disclosed a potential initiator 

15 methionine preceded by an in frame stop codon. This cDNA encodes a putative 

protein of 824 amino acids with a calculated molecular mass of 92.2 kDa and a pi of 
6.5. Next, it was determined that this cDNA encodes the pl05 protein co- 
immunoprecipitates with SMN. Thus, this is a full-length cDNA clone (SEQ ID NO:l) 
(GenBank accession number AF171063) for a novel component of the SMN complex 

20 designated Gemin3 for component of gems number 3 (see below). Because of the 

existence of several unrelated proteins called SIP1 (Mylin et al., 1994, Genetics 
137:689-700; Zhang et al., 1998, Mol. Cell. Biol. 18:676-684; Verschueren et al., 1999, 
J. Biol. Chem. 274:2089-2098), this protein has been tentatively renamed Gemin2, for 
component of gem s number 2 (SMN is the first component of gems identified, Liu and 

25 Dreyfuss, 1996, EMBO J. 15:3555-3565). Gemin3 has high amino acid sequence 

similarities with the RNA-helicase core region of the human eukaryotic initiation factor 
4A-II (eIF4A-II). eIF4A-II is a DEAD-box RNA helicase that belongs to the SFII 
superfamily of helicases (reviewed in De la Cruz et al., 1999, TIBS 24: 192-198 ). A 
scheme depicting the modular structure of Gemin3 and the predicted amino acid 

30 sequence of Gemin3 aligned with the sequence of eIF4A-II is presented in Figure 26. 
This alignment disclosed the presence of seven motifs in the Gemin3 protein, motifs 
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that are characteristic of the RNA helicase core region. Database searches with the C- 
terminal nonconserved region did not reveal significant homology to any other protein 
or to any recognizable motifs. 

Production of monoclonal antibodies to Gemin3 

5 To investigate the interaction of Gemin3 with SMN and to characterize 

Gemin3 further, monoclonal antibodies that specifically bind Gemin3 were produced 
by immunizing mice with a purified, bacterially produced recombinant 6His-tagged 
Gemin3 fragment (amino-acids 368 to 548). Two hybridomas, 1 1G9 and 12H12, were 
selected for additional studies. The data disclosed herein demonstrate that these 

10 hybridomas produce monoclonal antibodies that recognize Gemin3 specifically. First, 
both 1 1G9 and 12H12 immunoprecipitate Gemin3 produced by in vitro transcription 
and translation from the Gemin3 cDNA, but do not immunoprecipitate similarly 
produced hnRNP Al or SMN proteins (Figure 28A). Second, the monoclonal antibody 
1 1G9 efficiently recognized purified 6His-Gemin3 on Western blots but did not 

15 recognize similarly produced and purified 6His-tagged Gemin2 (Gemin2) (Figure 

28B). Finally, on an immunoblot of total Hela lysate, both 1 1G9 and 12H12 recognize 
a single protein of approximately 105 kDa (Figure 28C). Monoclonal antibodies 1 1G9 
or 12H12 did not recognize a specific protein on a Western blot with total mouse 3T3 
cell lysate or Xenopus laevis XL- 177 cell lysate. However, 1 1G9 specifically 

20 immunoprecipitated a single protein of about 105 kDa from these cell lysates 
suggesting that Gemin3, like SMN, is conserved in vertebrates. 
Gemin3 and SMN co-localize in gems 
Indirect laser confocal immunofluorescence microscopy using 
antibodies 1 1G9 and 12H12 was performed on Hela cells to determine the subcellular 

25 localization of Gemin3. Figure 29 A depicts that Gemin3 is found throughout the 

cytoplasm and also displays intense staining of prominent discrete nuclear bodies that 
are also readily discernable by differential interference contrast (DIC) microscopy 
(Figure 29B). This pattern is similar to that seen for SMN and Gemin2 (Liu and 
Dreyfuss, 1996, EMBO J. 15:3555-3565; Example 1, supra), except that the 

30 nucleoplasms staining of Gemin3 was stronger that the patterns detected for SMN and 
Gemin2 (SIP1). 
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In order to assess whether the nuclear structures stained by 1 1G9 were 
gems or coiled bodies, double-label immunofluorescence experiments were performed 
using antibodies against Gemin3 and to either p80-coilin as a marker of coiled bodies 
(Andrade et al., 1991, J. Exp. Med. 173:1407-1419) or to SMN as a marker of gems 
5 (Liu and Dreyfiiss, 1996, EMBO J. 15:3555-3565; Figure 29A). In many cell lines 

gems and coiled bodies entirely overlap by antibody staining, however, in the HeLa PV 
strain used herein, these two bodies are frequently found separate from each other (Liu 
and Dreyfiiss, 1996, EMBO J. 15:3555-3565; Matera and Frey, 1998, Am. J. Hum. 
Genet. 63:3 17-321). Therefore, HeLa PV cells were used to examine whether Gemin3 

10 is located in gems or in coiled bodies. As depicted in Figure 29C-29D, the nuclear 

structures that contain Gemin3 were clearly distinct from coiled bodies but Gemin3 
completely co-localized with SMN in gems (Figure 29E). The co-localization of 
Gemin3 with SMN strongly supports the conclusion that these two proteins exist as a 
complex in the ceil. Gemin3 is, thus, the third constituent of gems described so far. 

15 Gemin3 is in a complex with SMN. Gemin2 and the spliceosomal Sm 

proteins 

To characterize further the Gemin3 complex, immunoprecipitations 
using antiGemin3 monoclonal antibodies and [ 35 S]methionine labeled HeLa cells were 
carried out in the presence of either Triton-X 100 or the more stringent detergent 

20 Empigen BB (Matunis et al., 1994, Methods Cell Biol. 44:191-205). The 

immunoprecipitated proteins were then analyzed by SDS-PAGE. As references for 
these immunoprecipitations, an immunoprecipitation with the anti-Sm monoclonal 
antibody Y12 (Lerner and Steitz, 1979, proc. Natl. Acad. Sci. USA 76:5495-5499; 
Lemer et al., 1981, Proc. Natl. Acad. Sci. USA 76:2737-2741) and an 

25 immunoprecipitation with the anti-SMN monoclonal antibody 2B 1 were also included 

for purposes of comparison. As shown in Figure 30A, several proteins can be co- 
immunoprecipitated with Gemin3 and the pattern of immunoprecipitated proteins is 
very similar to that obtained with the anti-SMN antibody. In addition to Gemin3, SMN 
and Gemin2, there are several prominent bands at 175 kDa, 95 kDa and 50 kDa. The 

30 two groups of proteins at 28 kDa and 15 kDa have been identified previously as the Sm 
B/B', DI-3, E, F and G proteins of snRNPs (e.g., Example 1). In addition, there were 
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protein bands which co-immunoprecipitated only with anti-SMN (at 60 kDa) or only 
anti-Gemin3 (at 115 kDa) monoclonal antibodies. As further evidence for the 
specificity of the antibodies used, the immunoprecipitations were performed in the 
presence of Empigen BB. Under these conditions, anti-Gemin3 and anti-SMN 
5 antibodies immunoprecipitate Gemin3 and SMN proteins respectively (4- Empigen BB, 
lane 1 1G9, and lane 2B1, respectively). Even in the presence of Empigen BB, a protein 
of 95 kDa was still present under these conditions in both of these 
immunoprecipitations, but not in the control SP2/0 immunoprecipitation suggesting 
that this unidentified protein interacts tightly with both Gemin3 and SMN. 

10 To confirm the co-immunopurification results, the interaction of 

Gemin3 with SMN, Gemin2 and the Sm proteins in HeLa cells was assessed in vivo 
using immunoprecipitations and Western blot experiments. The anti-Gemin3 
monoclonal antibody 1 1G9 was used for immunoprecipitation from total HeLa cell 
extracts, and these extracts were then resolved by SDS-PAGE, transferred to 

15 nitrocellulose, and an immunoblot was probed with the anti-SMN antibody (Liu and 
Dreyfuss, 1996, EMBO J. 15:3555-3565). As depicted in Figure 30C (lane 1 1G9 IP), 
2B1 readily detects SMN in the 1 1G9 immunoprecipitates indicating that SMN is co- 
immunoprecipitated with Gemin3. Because SMN is associated with Gemin2 to form a 
stable complex in vivo and in vitro (see, e.g., Example 1), whether Gemin3 could be 

20 co-immunoprecipitated with Gemin2 was determined. As disclosed in Figure 30C, the 
anti-Gemin2 monoclonal antibody 2S7 clearly detects Gemin2 in the anti-Gemin3 
1 1G9 immunoprecipitates (lane 1 1G9 IP). In a reciprocal experiment, the Gemin3 
protein could also be co-immunoprecipitated by the anti-SMN monoclonal antibody 
2B1 (Figure 30D, lane 2B1 IP) and the anti-Gemin2 monoclonal antibody 2S7 (Figure 

25 30D, lane 257 IP). Because SMN and Gemin2 are found in a complex with the Sm 
proteins, it was determined whether Gemin3 can be co-immunoprecipitated with the 
spliceosomal snRNP Sm core proteins as well. Figure 30D demonstrates that Gemin3 
is present in the anti-Sm monoclonal antibody Y12 immunoprecipitates (lane IP Y12) 
like SMN and Gemin2 (Liu et al., 1997, Cell 90:1013-1021). No Gemin3, SMN, 

30 Gemin2 or Sm proteins were detected in a SP2/0 immunoprecipitate. These results 

demonstrate that Gemin3, SMN, Gemin2, and the Sm proteins are associated in vivo in 
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a complex that can be immunoprecipitated by either anti-SMN, anti-Gemin2, anti-Sm 
or anti-Gemin3 antibodies. 

Further support for the existence in vivo of a complex that contains 
SMN, Gemin2 and Gemin3 was obtained from gel filtration experiments. HeLa 
5 cytoplasmic SI 00 extract was fractionated on a Superose 6 HR 10/30 high performance 
gel filtration column and each fraction was subjected to SDS-PAGE followed by 
Western blot with anti-Gemin3, anti-SMN and anti-Gemin2 monoclonal antibodies. 
Gemin3, SMN and Gemin2 co-migrated and showed a peak at about 800 kDa 
demonstrating that they are components of a large macromolecular complex (Figure 

10 30E). A second pool of SMN-Gemin2, lacking Gemin3, was observed in a lower 

molecular weight complex which peaked at about 150 kDa suggesting that at least two 
different SMN-Gemin2 sub-complexes exist in vivo. However, without wishing to be 
bound by any particular theory, the possibility cannot be excluded that the 1 50 kDa 
sub-complex corresponded to a fraction of SMN-Gemin2 that dissociated from Gemin3 

15 during cell fractionation and/or chromatography. The data previously disclosed 

elsewhere herein demonstrate that a SMN-Gemin2 complex migrates at about 300 kDa, 
or higher, after filtration of a cytoplasmic S100 extract on a TSK-GEL G3000-SW 
column (Example 1). The Superose 6 HR 10/30 gel filtration column used herein 
permitted a better resolution of the cytoplasmic SMN complex and permitted a more 

20 accurate estimate as to its size which is about 800 kDa. 

Gemin3 interacts directly with SMN and the spliceosomal Sm proteins 

in vitro. 

To further analyze the Gemin3 complex, in vitro protein binding assay 
between Gemin3 and several components of the SMN complex were performed. For 

25 in vitro binding assays, Gemin3 was produced as a fusion protein with glutathione S- 
transferase (GST), and SMN and Gemin2 were produced and labeled with 
[ 35 S]methionine by in vitro transcription and translation in rabbit reticulocyte lysate. 
Purified GST or GST-Gemin3 fusion immobilized on glutathione-Sepharose were 
incubated with labeled SMN or Gemin2 proteins. Following extensive washing, bound 

30 proteins were eluted by boiling in SDS-containing sample buffer and the eluted 

material was analyzed by SDS-PAGE and detected by fluorography. Full length SMN, 
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but not Gemin2, bound specifically to immobilized GST-Gemin3 (Figure 3 1 A) but not 
to GST alone. To investigate whether Gemin3 interacts with Sm proteins, purified 
GST or GST-Gemin3 recombinant proteins were used for binding assays with in vitro 
[ 35 S]methionine labeled Sm proteins B, Dl, D2, D3, E, F and G (Lehmeier et al.,1994, 
5 Proc. Natl. Acad. Sci. USA 91:12317-12321; Herrmann et al., 1995, EMBO J. 

14:2076-2088; Raker et al., 1996, EMBO J. 15:2256-2269). The results, depicted in 
Figure 3 IB, demonstrate that the Sm proteins B and D3 bind to GST-Gemin3 whereas 
there was no detectable binding to GST alone. D2 binds Gemin3 only weakly and the 
profiles of Sm protein binding to SMN and Gemin3 are not identical (see Example 1). 

10 For example, SMN binds to Dl while Gemin3 does not. 

To address the possibility that some component of the rabbit 
reticulocyte lysate mediates these interactions, wild-type full-length SMN and SmB 
were produced as recombinant 6His-tagged proteins and were incubated with GST or 
GST-Gemin3. After several rounds of washing, bound proteins were solubilized by 

15 boiling in SDS-sample buffer, resolved by SDS-PAGE, immunoblotted and probed 

with a rabbit polyclonal antibody specific to the 6His-tag. As depicted in Figure 31C, 
SMN and SmB bind specifically to Gemin3 but not to GST alone. Thus, both SMN 
and SmB interact directly with Gemin3. 

In order to further characterize the interaction between Gemin3 and 

20 SMN, it was determined whether SMN carrying two well-characterized mutations 

found in SMA patients, the Y272C point mutant (SMNY272C) and the exon 7 deletion 
mutant (SMNAEx7), the major product of the SMN2 gene (Reviewed in Burghes, 
1997, Am. J. Hum. Genet. 61:9-15; Talbot et al., 1997, Hum. Mol. Genet. 6:497-500), 
was able to interact with Gemin3. SMN wild type and mutants were produced and 

25 labeled with [ 35 S]methionine using in vitro transcription and translation in rabbit 

reticulocyte lysate. Full-length wild-type SMN bound specifically to immobilized 
GST-Gemin3 (Figure 3 ID). However, SMNY272C and SMNAEx7 are severely 
defective in their ability to bind GST-Gemin3. No detectable binding was observed to 
GST alone. Similar results were observed using purified recombinant 6His-SMN wild 

30 type and mutant proteins instead of in vitro translated products. 

SMN oligomerization and Sm binding are not mutually exclusive, and 
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in fact, Sm binding is strongly enhanced by SMN oligomerization {e.g., Figure 3 IE). 
To determine whether SMN self-association enhances Gemin3 interaction, GST-SMN, 
or GST as a control, was pre-incubated with a molar excess of recombinant 6His-tag 
SMN to form SMN oligomers. After removing the unbound 6His-tag SMN by 
5 washing, in vitro translated [ 35 S]methionine-labeled Gemin3 and SmB were added and 
assayed for binding (Figure 3 IE). SmB binding was strongly enhanced by SMN 
oligomerization, however, Gemin3 binding was not affected. 

Gemin3 interacts with SMN via its unique C-terminal domain 
The unwinding activity of DEAD box RNA helicases may not be 

10 sequence specific. The target specificity of these proteins is at least in some eases 
provided by their interaction with specific proteins of the RNP substrate. These 
interactions appear to be mediated via the unique auxiliary domain that each RNA 
helicase contains (Staley and Guthrie, 1998, Cell 92:315-326; Hamm and Lamond, 
1998, Curr. Biol. 8:532-534). Therefore the role of the unique C-terminal domain of 

1 5 Gemin3 (amino acids 430-825) in the interaction with SMN was determined. To do so, 

three deletion mutants of Gemin3 were constructed and tested their ability to bind with 
GST-SMN. Wild-type and mutant myc-Gemin3 constructs were transcribed and 
translated in rabbit reticulocyte lysate in the presence of [ 35 S] methionine, and the 
resultant translated products were assayed for binding to GST-SMN as described 

20 previously elsewhere herein. As Figure 32B indicates, the wild-type myc-Gemin3 

protein and myc-AN368C277Gemin3 mutant proteins interact specifically with GST- 
SMN but not with GST alone. The myc-AC328Gemin3 and mycAN548Gemin3 
mutant proteins clearly do not interact with GST-SMN. Thus, the C-terminal domain 
of Gemin3 (amino acid 456 to 547) mediates the interaction of SMN with Gemin3. 

25 The molecular characterization of the spinal muscular atrophy gene 

product, SMN, demonstrated that it is concentrated in novel nuclear structures called 
gems (Liu and Dreyfuss, 1996, EMBO J. 15:3555-3565; Example 1). Coiled bodies 
and gems represent nuclear structures that appear to be involved in RNA metabolism 
and in many of the cell lines studied these two bodies are often found in association 

30 (Lamond and Carmo-Fonesca., 1993, Trends Cell Biol. 3:198-204; Gall et al., 1995, 
Dev. Genet. 16:25-35; Liu et al, 1997, Cell 90:1013-1021 ; Liu and Dreyfuss., 1996, 
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EMBO J. 15:3555-3565; Example 4; Matera and Frey, 1998, Am. J. Hum. Genet 
63:31 7-32 1). SMN is also found in the cytoplasm where, together with its tightly 
associated partner, Gemin2, it functions in the assembly of snRNP particles (Example 
2; Example 4). In the nucleus, SMN is required for pre-mRNA splicing, and likely 

5 serves to assemble and maintain the splicing machinery in an active form as discussed 

previously elsewhere herein. To perform these functions, SMN must either have an 
intrinsic activity or it must recruit to the complex other proteins that can actively affect 
structural transitions in certain RNP targets. Several factors that have the capacity to 
serve in such functions, including assembly and disassembly of components of the 

1 0 splicing machinery, have been described. Many of these factors are DEAD/DEAH box 

RNA helicases that are essential for splicing (reviewed in Staley and Guthrie, 1998, ). 
Prp43, for instance, is required for the disassembly of the snRNP-intron lariat complex 
(Arenas and Abelson, 1997, Proc. Natl. Acad. Sci. USA 94:11798-11802), Prp22 is 
needed to release the mature mRNA from the spliceosome (Company et al., 1991, 

15 Nature 349:487-493), and Prp24 acts as a recycling factor for U4 and U6 snRNP 
(Raghunathan et al., 1998, Science 279:857-860). 

Using a biochemical approach to characterize new components of the 
SMN complex, a novel DEAD box RNA helicase termed Gemin3 has been identified. 
Gemin3 forms a stable complex with SMN in vivo and in vitro and it co-localizes with 

20 SMN in nuclear gems. Several lines of evidence suggest that Gemin3 and SMN 

function as a complex in vivo. SMN and Gemin3 can be co-immunoprecipitated and 
both are present in a large (approximately 800 kDa) complex that also contains 
Gemin2. Anti-SMN, anti-Gemin2 or anti-Gemin3 monoclonal antibodies 
irnmunoprecipitate the spliceosomal snRNP core Sm proteins, as well as several other 

25 unidentified proteins. Gemin3 interacts directly with SMN and with several snRNP 
Sm core proteins, including B/B', D2 and D3. In addition, Gemin3 is uniformly 
distributed in the cytoplasm, where snRNP assembly takes place, and it can be 
specifically co-immonuprecipitated with the cytoplasmic pool of Sm proteins. 
Together, these findings suggest that Gemin3 plays an important role in spliceosomal 

30 snRNP biogenesis. 

DEAD box proteins have been found to be involved in many aspects of 



10873 v.2 



-151- 



RNA metabolism, including pre-mRNA splicing, translation, snRNP-snRNP 
interactions, mRNA degradation, and mRNA transport in eukaryotes and prokaryotes 
(Arenas and Abelson, 1997, Proc. Natl. Acad. Sci. USA 94:1 1798-1 1802; Company et 
al., 1991, Nature 349:487-493; Ohno and Shimura, 1996, Genes & Dev. 10:997-1007; 
Staley and Guthrie, 1998, Cell 92:315-326; Hamm and Lamond, 1998, Curr. Biol. 
8:532-534; De la Cruz et al., 1999, TIBS 24:192-198). One of the major questions 
about the function of each DEAD/DEAH box RNA helicase is the identification of the 
specific RNA target for it. Some of the enzymes of this family can unwind generic 
RNA substrates in vitro. For these enzymes, the specificity towards particular RNAs 
therefore appears to be determined by factors that interact with their unique auxiliary 
domains. For example, the DEAH-box RNA helicase Prpl6 is recruited to the 
spliceosome via its unique N-terminal. The specific substrate for Gemin3 has not yet 
been identified and this remains a central question of interest. Although the RNA 
helicase or RNA-dependent ATPase activity for recombinant Gemin3 has not been 
detected, it is possible that such activity will only manifest itself when Gemin3 is 
associated with other proteins as part of a complex, or that it will be detectable once a 
specific RNA or RNP target is found. The interaction of Gemin3 with SMN is direct, 
amino acids 456 to 547 of Gemin3 mediate this interaction and, likely as a 
consequence of this, also mediate the localization of Gemin3 to the gems. Thus, 
without wishing to be bound by theory, Gemin3 provides the enzymatic activity of the 
SMN complex to affect structural transitions in its RNA targets. 

The SMN protein is capable of forming an oligomer of more than 400 
kDa in vitro and the data disclosed herein demonstrate that SMN co-migrates with an 
approximately 800 kDa complex that also contains Gemin2 and Gemin3. It is likely 
that SMN oligomerization is critical for the nucleation of this large complex. In 
addition to Gemin3 and Gemin2, several Sm proteins interact with SMN, and it may be 
that SMN forms a docking platform to bring together in the appropriate spatial 
arrangement the multiple proteins that are involved in the de novo assembly and 
regeneration of its RNP {e.g., snRNP) substrates. Interestingly, the interaction of SMN 
with Gemin3 is severely reduced by mutations found in SMA patients, such as the 
point mutant SMNY272C or the exon 7 deletion. Thus, the formation of the SMN 
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platform seems critical for SMN function because SMA affects both the capacity of 
SMN to oligomerize as well as to interact with several Sm proteins and Gemin3. 
Likely as a consequence of these defective interactions, the function of SMN in the 
regeneration of the splicing machinery is abolished. 
5 Coiled bodies contain the highest local concentration of p80 coilin and 

are enriched in components of three major RNA processing pathways: pre-mRNA 
splicing, histone mRNA 3' maturation and pre-mRNA processing. Gems contain the 
highest local concentration of SMN, Gemin2 and Gemin3 and are often found 
associated with coiled bodies {see, e.g., Liu and Dreyfuss, 1996, EMBO J. 15:3555- 

10 3565; Example 1). Although the definitive function of these two nuclear bodies has 

not been completely elucidated, the characterization of their protein and RNA contents 
represents an important step toward the understanding of their functions. Further 
studies of Gemin3, a novel DEAD box containing protein and component of gems, will 
shed light on the functions of the SMN complex and gems. 

15 Example 6: Production of cell line comprising reduced level SMN for 

study of SMA 

To facilitate the study of the function of SMN in motor neurons and to 
develop a system that may be useful for testing potential therapeutic approaches, a cell 
line can be generated that expresses drastically reduced levels of SMN. The cell can be 

20 produced by transfecting a neuronal cell line which normally expresses SMN with an 
SMN-modulating sequence, e.g., antisense nucleic acid complementary to a nucleic 
acid encoding SMN or an SMN-specific ribozyme. A stable cell line can be obtained 
having significantly reduced SMN protein levels as demonstrated by decreased SMN 
mRNA expression. 

25 Generation of cell lines 

Motor neuron cell lines have been generated (Salazar Grueso et al., 
Neuroreport. 2:505-508) from embryonic murine spinal cord cells enriched for motor 
neurons by differential centrifiigation, based on the approach originally described by 
Hammond et al (Science 234:1237-1240). Human SMN cDNA clone BCD541 is 

30 available in the art. SMN cDNA constructs, in either sense or antisense orientation, 
may be subcloned into a mammalian expression vector, for example, pZeoSV2 
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(InVitrogen, Carlsbad, CA), using standard methods (see, e.g., Sambrook et al., supra) 
and Ausubel et al., supra). After transfection of the motor neuron cell line with sense, 
antisense, or vector-only constructs, stable integrants may be isolated and the resultant 
transgenic cell lines can be referred to as "sense", "antisense" or "vector-only" cells, 
5 respectively. The cell lines are cultured as described in Salazar Grueso et al., supra. 

DNA Isolation and PCR Analysis 

The cells are grown in culture and DNA is isolated therefrom using 
standard methods. PCR assays may be performed using primers and probes specific 
for human SMN cDNA as described in, for example, Sambrook et al., supra, and 
1 0 Ausubel et al . , supra. 

RNA Isolation and RT-PCR 

Total RNA may be extracted from cells using the Rneasy kit (Qiagen, 
Chatsworth, CA) according to the manufacturer's instructions. Total RNA may be 
reverse-transcribed using, for example, the Superscript II reverse transcriptase 
15 (GIBCO/BRL) and random hexamers per the manufacturer's instructions. PCR may be 

performed as described elsewhere herein using human SMN specific primers. The 
PCR products may be separated on an agarose gel and may then be visualized by 
staining with ethidium bromide. 

Western blot analysis 

20 Total cell lysates in the presence of 0.5% TritonX-1 00 are prepared as 

previously described by Coovert et al. (1997, Hum. Mol. Gen. 6:1205-1214). Equal 
amounts of proteins are loaded on each lane of a SDS-PAGE gel and the Western blot 
is performed as previously described herein. The following monoclonal antibodies can 
be used: anti-SMN (2B1), anti-SIPl (2E17), anti-hnRNPAl (4B10) (Pinol-Roma et 

25 al., 1988, Genes & Dev. 2:215-227), and anti-Sm (Y12) (Lerner et al., 1981, Proc. 
Natl. Acad. Sci. USA 78:2737-2741). 

Immunofluorescence analysis 

Immunofluorescence staining is be carried out essentially as previously 
described (Choi et al., 1984, J. Cell. Biol. 99:1997-2004). Double-label 
30 immunofluorescence experiments are performed by separate sequential incubations of 

each primary antibody, at an appropriate dilution, followed by incubation with the 
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specific secondary coupled to fluoresceineisothiocyanate (FITC) or Texas Red 
(TXRD). Laser confocal fluorescence microscopy is performed and the images from 
each channel are recorded separately, and then the data files are merged. The 
following antibodies may be used: rabbit polyserum anti-p80 coilin (R288) (Andrade et 
5 al., 1991, J. Exp. Med. 173:1407-1419), mouse IgGl monoclonal anti-SMN (2B1), and 
mouse IgGl monoclonal anti-SIPl (2E17). 
Cell Growth study 

Cells from each of the three cell lines, sense, antisense and vector-only, 
may be synchronized by serum-deprivation and the cells may be plated in triplicate for 
10 growth analyses. 

Transfection of a neuronal cell line with SMN cDNA constructs 
Human SMN cDNA constructs, in either sense or antisense orientation, 
are subcloned into a mammalian expression vector, pZeoSV2, to allow constitutive 
expression of the insert driven by the SV40 early enhancer/promoter. A spinal cord 
1 5 neuronal cell line exhibiting the characteristics of primary motor neurons, including 

high levels of choline acetyltransferase (ChAT), extensive branching neurite formation, 
and glial-derived neurotrophic factor (GDNF) receptors described by Salazar Grueso et 
al. (Neuroreport 2:505-508; Trupp et al, Nature 381:785-788), may be used for 
transfections. 

20 Motor neuron cells are transfected with sense, antisense, or vector-only 

constructs, and the cells are grown in selective medium (e.g., medium containing 
zeomycin in the case of pZeoSV2 constructs). Stable transfectants are isolated, and 
these are referred to as sense, antisense, or vector-only cells, respectively. The 
introduction of a human cDNA encoding SMN in a cell line permits the confirmation 

25 of the presence or absence of the human SMN sequence, as well as facilitating the 

monitoring of human SMN expression. 

Detection of the presence and expression of human SMN in 

transfectants 

The presence of the human SMN transgene DNA in both the sense and 
30 antisense cells, but not in vector-only cells, may be confirmed by PCR of DNA 

obtained from stable transfectants using human-specific primers. Further, expression 



10873 v.2 



- 155- 



of the human SMN sense or antisense cDNA constructs may be demonstrated by 
RT-PCR of total RNA using human-specific primers. Using these primers, SMN RNA 
may also be detected in positive control human tissue. 

Detection of altered SMN protein level in transfectants 
5 Immunoblotting using the anti-SMN monoclonal antibody, 2B1, should 

identify whether the cells exhibit reduced levels of expression of SMN. 

Example 7: Cell line lacking endogenous SMN expression 
To facilitate the study of the function of SMN in cells and to develop a 
system for studying the role(s) of SMN in cell processes as well as for testing potential 

10 therapeutic approaches for SMA, a cell line was generated that lacked endogenous 
expression of SMN but was stably transfected with a plasmid vector comprising a 
nucleic acid encoding SMN under the control of an inducible promoter. More 
specifically, a chicken pre-B lymphoid cell line, i.e., DT40, which exhibited a high 
degree (i.e., approximately 1,000-fold higher than normal) of homologous 

15 recombination, was used to generate a cell line lacking a nucleic acid encoding chicken 
SMN. This cell line, wherein the endogenous cSMN expression was repressed by 
expression of non-functional HA-cSMN expressed under the control of an inducible 
promoter (i.e., the tetracycline repressible promoter) (see, e.g. s Figures 33 A, 33B, 34A, 
and 34B), and allows the rapid growth of the recombinant cells as long as tetracycline 

20 is present in the culture medium thereby repressing expression of HA-cSMN. 

However, removal of tetracycline from the medium caused derepression of the 
promoter, expression of HA-cSMN, and inhibition of endogenous wild type cSMN 
expression in the cell. Thus, the cell line provides a stable genetic system that can be 
easily manipulated to emulate the lack of SMN exhibited by the cells of SMA patients 

25 thereby providing a useful system for study of the mechanisms associated with SMA 
and for identifying compounds useful for the treatment of SMA. 

Decreased or absent levels of SMN in the cell of the invention impaired 
cell growth, and eventual cell death, which was presumably mediated by impairment of 
cellular processes in a manner analogous to the disease process demonstrated by SMA 

30 patients. Therefore, the recombinant cell line containing decreased levels of SMN 
protein is a useful model for SMA which is characterized by reduced, or absent, 
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amounts of SMN. Further, the cell line expressing altered, e.g., decreased or absent 
levels of SMN, provides an important tool for identifying compounds useful for 
treatment of SMA. In addition, the cell line can be transfected with mutant, variant, 
and/or homologous forms of SMN such that the effects of change in SMN can be 
5 identified and characterized providing a system for the study of naturally occurring 
and/or genetically engineered selected forms of SMN. 
Example 8: SMN knock-out cell 

To facilitate the study of the function of SMN in cells and to develop a 
system for studying the role(s) of SMN in cell processes as well as for testing potential 

10 therapeutic approaches for SMA, a cell line is generated that lacks an endogenous 

nucleic acid encoding SMN but which comprises an exogenous nucleic acid encoding 
SMN under the control of an inducible promoter. More specifically, a chicken pre-B 
lymphoid cell line, i.e., DT40, which exhibits a high degree (i.e., approximately 1,000- 
fold higher than normal) of homologous recombination, is used to generate a cell line 

15 lacking a nucleic acid encoding chicken SMN. Although other cell lines can be used to 

generate an SMN "knock-out" cell line, DT40 provides the advantage that SMN is 
present in the cell as a single allele. Thus, unlike the mammalian genome which 
comprises centromeric and telomeric alleles of SMN, as discussed previously 
elsewhere herein, the DT40 cell line comprises a single genetic locus comprising a 

20 nucleic acid encoding SMN. Therefore, although other cell lines can be used, DT40 

provides an advantageous system for the creation of SMN knock-out cell lines. 

This cell line, wherein the endogenous sequence encoding SMN is 
M knocked-out", is then stably transfected using a plasmid vector comprising an isolated 
nucleic acid encoding chicken SMN under the control of an inducible promoter. This 

25 system allows the rapid growth of the recombinant cells as long as the promoter is 

induced and SMN is expressed in the cells. However, repression of the promoter 
inhibits production of SMN and the cells eventually cease growth in culture and or 
demonstrate altered growth characteristics as discussed elsewhere herein. Thus, the 
cell line provides a stable genetic system that can be easily manipulated to emulate the 

30 lack of SMN exhibited by the cells of SMA patients thereby providing a useful system 
for study of the mechanisms associated with SMA and for identifying compounds 
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useful for the treatment of SMA. 

Decreased or absent levels of SMN in the cell of the invention impairs 

cell growth which is mediated by impairment of cellular processes in a manner 

analogous to the disease process demonstrated by SMA patients. Therefore, the 
5 recombinant cell line containing decreased levels of SMN protein is a useful model for 

SMA which is characterized by reduced, or absent, amounts of SMN. Further, the cell 

line expressing altered, e.g., decreased or absent levels of SMN, provides an important 

tool for identifying compounds useful for treatment of SMA. 

In addition, the cell line can be transfected with mutant, variant, and/or 
10 homologous forms of SMN such that the effects of change in SMN can be identified 

and characterized providing a system for the study of naturally occurring and/or 

genetically engineered selected forms of SMN. 

The disclosures of each and every patent, patent application, and 

publication cited herein are hereby incorporated herein by reference in their entirety. 
1 5 While the invention has been disclosed with reference to specific 

embodiments, it is apparent that other embodiments and variations of this invention 

may be devised by others skilled in the art without departing from the true spirit and 

scope of the invention. The appended claims are intended to be construed to include all 

such embodiments and equivalent variations. 
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